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NOVEL COMPOUNDS 

Field of the Invention: 

This invention relates to newly identified polynucleotides, polypeptides encoded by 
such polynucleotides, the uses of such polynucleotides and polypeptides, as well as the 
S production of such polynucleotides and polypeptides and recombinant host cells 
transformed with the polynucleotides. This invention also relates to inhibiting the 
biosynthesis or action of such polynucleotides or polypeptides and to the use of such 
inhibitors in therapy. Preferred embodiments of the invention include Streptococcal 
polypeptides and polynucleotides, particularly those of Streptococcus pneumoniae. 

10 Background of the Invention; 

The Streptococci make up a medically important genera of microbes known to 
cause several types of disease in humans, including otitis media, pneumonia and meningitis. 
Since its isolation more than 100 years ago, Streptococcus pneumoniae has been one of the 
more intensively studied microbes. For example, much of our early understanding that 

15 DNA is, in fact, the genetic material was predicated on the work of Griffith and of Avery, 
Macleod and McCarty using this microbe. Despite the vast amount of research with S. 
pneumoniae, many questions concerning the virulence of this microbe remain. 

While certain Streptococcal factors associated with pathogenicity have been 
identified, e.g., capsule polysaccharides, peptidoglycans. pneumolyses, PspA Complement 

20 factor H binding component, autolysin, neuraminidase, peptide permeases, hydrogen 
peroxide, IgAl protease, the list is certainly not complete. Further very little is known 
concerning the temporal expression of such genes during infection and disease progression 
in a mammalian host. Discovering the sets of genes the bacterium is likely to be expressing 
at the different stages of infection, particularly when an infection is established, provides 

25 critical information for the screening and characterization of novel antibacteriais which can 
interrupt pathogenesis. In addition to providing a fuller understanding of known proteins, 
such an approach will identify previously unrecognised targets. 
Brief Description of the Invention: 

This invention provides novel proteins, particularly those from Streptococcus 

30 pneumoniae, strain 0100993, characterised in that it comprises the amino acid sequences 
given herein or fragments, analogues or derivatives thereof. 

In accordance with another aspect of the present invention, there are provided 
polynucleotides (DNA or RNA) which encode such polypeptides. 
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In particular the invention provides polynucleotides having the DNA sequences 
given herein. 

The invention also relates to novel oligonucleotides derived from the sequences 
given herein which can act as PGR primers in the process herein described to determine 
5 whether or not the Streptococcus pneumoniae gene identified herein in whole or in part is 
expressed in infected tissue. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. The 
proteins so identified are also useful as targets in screens designed to identify antimicrobial 
compounds. 

10 It is an object of the invention to provide polypeptides that have been identified as novel 

polypeptides by homology between the amino acid sequence set out in the Sequence Listing and 
a known amino acid sequence or sequences of other proteins such as the proteins identified under 
the heading Identity in Table 1. 

It is a further object of the invention to provide polynucleotides that encode polypeptides 

15 set forth in the Sequence Listing, particularly polynucleotides that encode the polypeptide set 
forth in the Sequence Listing. 

In a particularly preferred embodiment of the invention the polynucleotide comprises a 
region encoding polypeptides comprising the sequence set out in the Sequence Listing, or a 
variant thereof. 

20 In another particularly preferred embodiment of the invention there is a novel 

protein from comprising an amino acid sequence of the Sequence Listing, or a variant 
thereof. 

In accordance with another aspect of the invention, there is provided the use of a 
polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
25 genetic immunization. Among the particularly preferred embodiments of the invention are 
naturally occurring allelic variants the polynucleotides set forth in the Sequence Listing and 
polypeptides encoded thereby. 

In accordance with yet another aspect of the invention, there are provided inhibitors 
to such polypeptides, useful as antibacterial agents, including, for example, antibodies. 
30 In accordance with certain preferred embodiments of the invention, there are provided 

products, compositions and methods for assessing expression of the sequences the Sequence 
Listing, treating disease, for example, otitis media, conjunctivitis, pneumonia, bacteremia, 
meningitis, sinusitis, pleural empyema and endocarditis, and most particularly meningitis, 
such as for example infection of cerebrospinal fluid, assaying genetic variation, and 
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administering a polypeptide or polynucleotide of the invention to an organism to raise an 
immunological response against a bacteria, especially a Streptococcus pneumoniae bacteria. 

In accordance with certain preferred embodiments of this and other aspects of the 
invention there are provided polynucleotides that hybridize to polynucleotide sequences of the 
5 invention, particularly under stringent conditions. 

In certain preferred embodiments of the invention there are provided antibodies against 
polypeptides of the invention. 

In other embodiments of the invention there are provided methods for identifying 
compounds which bind to or otherwise interact with and inhibit or activate an activity of a 
1 0 polypeptide or polynucleotide of the invention comprising: contacting a polypeptide or 

polynucleotide of the invention with a compound to be screened under conditions to permit 
binding to or other interaction between the compound and the polypeptide or polynucleotide to 
assess the binding to or other interaction with the compound, such binding or interaction being 
associated with a second component capable of providing a detectable signal in response to the 
1 5 binding or interaction of the polypeptide or polynucleotide with the compound; and determining 
whether the compound binds to or otherwise interacts with and activates or inhibits an activity of 
the polypetide or polynucleotide by detecting the presence or absence of a signal generated from 
the binding or interaction of the compound with the polypeptide or polynucleotide. 

In accordance with yet another aspect of the invention, there are provided agonists and 
20 antagonists of the polynucleotide and/or polypeptides of the invention, preferably bacteriostatic 
or bacteriocidal agonists and antagonists. 

In a further aspect of the invention there are provided compositions comprising a 
polynucleotide or a polypeptide of the invention for administration to a cell or to a multicellular 
organism. 

25 Various changes and modifications within the spirit and scope of the disclosed invention 

will become readily apparent to those skilled in the art from reading the following descriptions 
and from reading the other parts of the present disclosure. 
Detailed Descr iption of the Invention: 

Each of the DNA sequences provided herein may be used in the discovery and 

30 development of antibacterial compounds. Because each of the sequences contains an open 
reading frame (ORF) with appropriate initiation and termination codons, the encoded 
protein upon expression can be used as a target for the screening of antimicrobial drugs. 
Additionally, the DNA sequences encoding preferably the amino terminal regions of the 
encoded protein or th Shine-Delgarno region can be used to construct antisense sequences 
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to control the expression of the coding sequence of interest. Furthermore, many of the 
sequences disclosed herein also provide regions upstream and downstream from the 
encoding sequence. These sequences are useful as a source of regulatory elements for the 
control of bacterial gene expression. Such sequences are conveniently isolated by 
5 restriction enzyme action or synthesized chemically and introduced, for example, into 
promoter identification strains. These strains contain a reporter structural gene sequence 
located downstream from a restriction site such that if an active promoter is inserted, the 
reporter gene will be expressed. 

Although each of the sequences may be employed as described above, this 

10 invention also provides several means for identifying particularly useful target genes. The 
first of these approaches entails searching appropriate databases for sequence matches in 
related organisms. Thus, if a homologue exists, the Streptococcal-like form of this gene 
would likely play an analogous role. For example, a Streptococcal protein identified as 
homologous to a cell surface protein in another organism would be useful as a vaccine 

15 candidate. To the extent such homologies have been identified for the sequences disclosed 
herein they are reported along with the encoding sequence. 

Recently techniques have become available to evaluate temporal gene expression in 
bacteria, particularly as it applies to viability under laboratory and infection conditions. A 
number of methods can be used to identify genes which are essential to survival per se f or 

20 essential to the establishment/maintenance of an infection. Identification of an ORF 
unknown by one of these methods yields additional information about its function and 
permits the selection of such an ORF for further development as a screening target. Briefly, 
these approaches include: 
1) Signature Tagged Mutagenesis (STM) 

25 This technique is described by Hensel & al, Science 269: 400-403( 1 995), the 

contents of which is incorporated by reference for background purposes. 
Signature tagged mutagenesis identifies genes necessary for the establishment/maintenance 
of infection in a given infection model. 

The basis of the technique is the random mutagenesis of target organism by various 

30 means (e.g., transposons) such that unique DNA sequence tags are inserted in close 

proximity to the site of mutation. The tags from a mixed population of bacterial mutants 
and bacteria recovered from an infected hosts are detected by amplification, radiolabeling 
and hybridisation analysis. Mutants attenuated in virulence are revealed by absence of the 
tag from the pool of bacteria recovered from infected hosts. 
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In Streptococcus pneumoniae , because the trail spos on system is less well 
developed, a more efficient way of creating the tagged mutants is to use the insertion- 
duplication mutagenesis technique as described by Morrison et al.. J, Bacteriol. 159:870 
(1984) the contents of which is incorporated by reference for background purposes. 
5 2) In Vivo Expression Technology (IVET) 

This technique is described by Camilli el aj., Proc . Nat'l . Acad . Sci . USA . 9 1 :2634- 
2638 (1994) t the contents of which is incorporated by reference for background purposes. 
IVET identifies genes up-regulated during infection when compared to laboratory 
cultivation, implying an important role in infection. ORF identified by this technique are 
10 implied to have a significant role in infection establishment/maintenance. 

In this technique random chromosomal fragments of target organism are cloned 
upstream of a promoter-less recombinase gene in a ptasmid vector. This construct is 
introduced into the target organism which carries an antibiotic resistance gene flanked by 
resolvase sites. Growth in the presence of the antibiotic removes from the population those 
15 fragments cloned into the plasmid vector capable of supporting transcription of the 

recombinase gene and therefore have caused loss of antibiotic resistance. The resistant poo) 
is introduced into a host and at various times after infection bacteria may be recovered and 
assessed for the presence of antibiotic resistance. The chromosomal fragment carried by 
each antibiotic sensitive bacterium should carry a promoter or portion of a gene normally 
20 upregulated during infection. Sequencing upstream of the recombinase gene allows 
identification of the up regulated gene. 
3) Differentia] display 

This technique is described by Chuang el aj., J. Bacteriol . ] 75:2026-2036 (1993), 
the contents of which is incorporated by reference for background purposes. This method 
25 identifies those genes which are expressed in an organism by identifying mRNA present 
using randomly-primed RT-PCR. By comparing pre-infection and post infection profiles, 
genes up and down regulated during infection can be identified and the RT-PCR product 
sequenced and matched to ORF 'unknowns'. 



5 
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4) Generation f c nditional lethal mutants by transposon mutagenesis. 

This technique, described by de Lorenzo, V. Gene 123:17-24 (1993); 
Neuwald, A. F. fiifil. Gene 125: 69-73(1993); and Takiff, H. E. §ial, J. Bacteriol . 
174:1544-1553(1992), the contents of which is incorporated by reference for 
5 background purposes, identifies genes whose expression are essential for cell viability. 

In this technique transposons carrying controllable promoters, which provide 
transcription outward from the transposon in one or both directions, are generated. Random 
insertion of these transposons into target organisms and subsequent isolation of insertion 
mutants in the presence of inducer of promoter activity ensures that insertions which 

10 separate promoter from coding region of a gene whose expression is essential for cell 

viability will be recovered. Subsequent replica plating in the absence of inducer identifies 
such insertions, since they fail to survive. Sequencing of the flanking regions of the 
transposon allows identification of site of insertion and identification of the gene disrupted. 
Close monitoring of the changes in cellular processes/morphology during growth in the 

15 absence of inducer yields information on likely function of the gene. Such monitoring 
could include flow cytometry (cell division, lysis, redox potential, DNA replication), 
incorporation of radiochemically labeled precursors into DNA, RNA, protein, lipid, 
peptidoglycan, monitoring reporter enzyme gene fusions which respond to known cellular 
stresses. 

20 5) Generation of conditional lethal mutants by chemical mutagenesis. 

This technique is described by Beckwith, J.. Methods in Enzymology 204: 
3-18(1991), the contents of which are incorporated herein by reference for background 
purposes. In this technique random chemical mutagenesis of target organism, growth at 
temperature other than physiological temperature (permissive temperature) and subsequent 

25 replica plating and growth at different temperature (e.g. 42°C to identify ts, 25°C to identify 
cs) are used to identify those isolates which now fail to grow (conditional mutants). As 
above close monitoring of the changes upon growth at the non-permissive temperature 
yields information on the function of the mutated gene. Complementation of conditional 
lethal mutation by library from target organism and sequencing of complementing gene 

30 allows matching with unknown ORF. 
6) RT-PCR 

Streptococcus pneumoniae messenger RNA is isolated from bacterial infected tissue e.g. 48 
hour murine lung infections, and the amount of each mRNA species assessed by reverse 
transcription of the RNA sample primed with random hexanucleotides followed by PCR 
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with gene specific primer pairs. The determination of the presence and amount of a 
particular mRNA species by quantification of the resultant PCR product provides 
information on the bacterial genes which are transcribed in the infected tissue. Analysis of 
gene transcription can be carried out at different times of infection to gain a detailed 
5 knowledge of gene regulation in bacterial pathogenesis allowing for a clearer understanding 
of which gene products represent targets for screens for novel antibacterials. Because of 
the gene specific nature of the PCR primers employed it should be understood that the 
bacterial mRNA preparation need not be free of mammalian RNA. This allows the 
investigator to carry out a simple and quick RNA preparation from infected tissue to 
10 obtain bacterial mRNA species which are very short lived in the bacterium (in the order of 2 
minute halflives). Optimally the bacterial mRNA is prepared from infected murine lung 
tissue by mechanical disruption in the presence of TRIzole (GIBCO-BRJL) for very short 
periods of time, subsequent processing according to the manufacturers of TRIzole reagent 
and DNAase treatment to remove contaminating DNA. Preferably the process is optimised 
15 by finding those conditions which give a maximum amount of S.pneumoniae 16S 

ribosomal RNA as detected by probing Northerns with a suitably labelled sequence specific 
oligonucleotide probe. Typically a 5' dye labelled primer is used in each PCR primer pair 
in a PCR reaction which is terminated optimally between 8 and 25 cycles. The PCR 
products are separated on 6% polyacrylamide gels with detection and quantification using 
20 GeneScanner (manufactured by ABI). 

Each of these techniques may have advantages or disadvantage depending on the 
particular application. The skilled artisan would choose the approach that is the most 
relevant with the particular end use in mind. For example, some genes might be recognised 
as essential for infection but in reality are only necessary for the initiation of infection and 
25 so their products would represent relatively unattractive targets for antibacterials developed 
to cure established and chronic infections. 

Use of the of these technologies when applied to the ORFs of the present invention 
enables identification of bacterial proteins expressed during infection, inhibitors of which 
would have utility in anti-bacterial therapy. 
30 Streptococcus pneumoniae, strain 01 00993 has been deposited at the National 

Collection of Industrial and Marine Bacteria Ltd. (NCIMB), Aberdeen, Scotland under 
NCIMB number 40794 on 1 1 April 1996, and a Streptococcus pneumoniae, strain 0100993 
DNA library in E. coli was similarly deposited on 17 April 1996 under NCIMB number 
40800. 



7 
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The nucleotide sequences disclosed herein can be obtained by synthetic chemical 
techniques known in the art or can be obtained from Streptococcus pneumoniae, strain 
0100993by probing a DNA preparation with probes constructed from the particular 
sequences disclosed herein. Alternatively, oligonucleotides derived from a disclosed 
5 sequence can act as PCR primers in a process of PCR-based cloning of the sequence from a 
bacterial genomic source. It is recognised that such sequences will also have utility in 
diagnosis of the stage of infection and type of infection the pathogen has attained. 

To obtain the polynucleotide encoding the protein using the DNA sequence given 
herein typically a library of clones of chromosomal DNA of Streptococcus pneumoniae, 

10 strain 0100993 in E. coii or some other suitable host is probed with a radiolabeled 

oligonucleotide, preferably a 17mer or longer, derived from the partial sequence. Clones 
carrying DNA identical to that of the probe can then be distinguished using high stringency 
washes. By sequencing the individual clones thus identified with sequencing primers 
designed from the original sequence it is then possible to extend the sequence in both 

15 directions to determine the full gene sequence. Conveniently such sequencing is performed 
using denatured double stranded DNA prepared from a plasmid clone. Suitable techniques 
are described by Maniatis, T., Fritsch, E.F. and Sambrook, J. in MOLECULAR CLONING, 
A Laboratory Manual, 2nd edition, 1989, Cold Spring Harbor Laboratory (see: Screening 
By Hybridization 1 .90 and Sequencing Denatured Double-Stranded DNA Templates 

20 13.70). 

A polynucleotide of the present invention may be in the form of RNA or in the 
form of DNA, which DNA includes cDNA, genomic DNA, and synthetic DNA. The DNA 
may be double-stranded or single-stranded, and if single stranded may be the coding strand 
or non-coding (anti-sense) strand. The coding sequence which encodes the polypeptide 

25 may be identical to the coding sequence shown or may be a different coding sequence 
which coding sequence, as a result of the redundancy or degeneracy of the genetic code, 
encoding the same polypeptide. 

The present invention includes variants of the hereinabove described 
polynucleotides which encode fragments, analogues and derivatives of the polypeptide 

30 characterized by the deduced amino acid sequence given herein. The variant of the 

polynucleotide may be a naturally occurring allelic variant of the polynucleotide or a non- 
naturally occurring variant of the polynucleotide. In addition to the standard A, G, C, TAJ 
representations for nucleic acid bases, the term "N M is also used. "N" means that any of the 
f ur DNA or RNA bases may appear at such a designated position in the DNA or RNA 
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sequence, except that N cannot be a base that when taken in combination with adjacent 
nucleotide positions, when read in the correct reading frame, would have the effect of 
generating a premature terminati n codon in such reading frame. 

Thus, the present invention includes polynucleotides encoding the same 
5 polypeptide characterized by the deduced amino acid sequence given herein as well as 
variants of such polynucleotides which variants encode for a fragment, derivative or 
analogue of the polypeptide. Such nucleotide variants include deletion variants, 
substitution variants and addition or insertion variants. 

The polynucleotide may have a coding sequence which is a naturally occurring 
10 allelic variant of the coding sequence characterized by the DNA sequence disclosed herein. 
As known in the art, an allelic variant is an alternate form of a polynucleotide sequence 
which may have a substitution, deletion or addition of one or more nucleotides, which does 
not substantially alter the function of the encoded polypeptide. 

The polynucleotide which encodes for the mature polypeptide, may include only 
1 5 the coding sequence for the mature polypeptide or the coding sequence for the mature 
polypeptide and additional coding sequence such as a leader or secretory sequence or a 
proprotein sequence. 

Thus, the term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well as a 
20 polynucleotide which includes additional coding and/or non-coding sequence. 

The present invention therefore includes polynucleotides, wherein the coding 
sequence for the mature polypeptide may be fused in the same reading frame to a 
polynucleotide sequence which aids in expression and secretion of a polypeptide from a 
host cell, for example, a leader sequence which functions as a secretory sequence for 
25 controlling transport of a polypeptide from the cell. The polypeptide having a leader 

sequence is a proprotein and may have the leader sequence cleaved by the host cell to form 
the mature form of the polypeptide. The polynucleotides may also encode for a proprotein 
which is the mature protein plus additional 5' amino acid residues. A mature protein having 
a prosequence is a proprotein and may be an inactive form of the protein. Once the 
30 prosequence is cleaved an active mature protein remains. 

Thus, for example, the polynucleotide of the present invention may code for a 
mature protein, or for a protein having a prosequence or for a protein having both a 
prosequence and a prosequence (leader sequence). Further, the amino acid sequences 
provided herein show a methionine residue at the NH 2 -terminus. It is appreciated, however, 
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that during post-translational modification of the peptide, this residue may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionineless amino terminal variants of each protein disclosed herein. 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence at either the 5* or 3' terminus of the gene which allows 
for purification of the polypeptide of the present invention. The marker sequence may be a 
hexa-histidine tag supplied by the pQE series of vectors (supplied commercially by Quiagen 
Inc.) to provide for purification of the polypeptide fused to the marker in the case of a 
bacterial host. 

The present invention further relates to polynucleotides which hybridize to the 
hereinabove-described sequences if there is at least 50% and preferably at least 70% identity 
between the sequences. The present invention particularly relates to polynucleotides, 
particularly Streptococcal polynucleotides, which hybridize under stringent conditions to 
the hereinabove-described polynucleotides . As herein used, the term "stringent conditions*' 
means hybridization will occur only if there is at least 95% and preferably at least 97% 
identity between the sequences. The polynucleotides which hybridize to the hereinabove 
described polynucleotides in a preferred embodiment encode polypeptides which retain 
substantially the same biological function or activity as the polypeptide characterised by the 
deduced amino acid sequence given herein. An example of stringent hybridization 
conditions is overnight incubation at 42°C in a solution comprising: 50% formamide, 5x 
SSC (150mM NaCl, 15mM trisodium citrate), 50 mM sodium phosphate (pH7.6), 5x 
Denhardt's solution, 10% dextran sulfate, and 20 micrograms/ml denatured, sheared salmon 
sperm DNA, followed by washing the hybridization support in 0.1 x SSC at about 65°C. 
Hybridization and wash conditions are well known and exemplified in Sambrook, ex al. y 
Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., 
(1989), particularly Chapter 1 1 therein. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence selected from the group consisting of the 
polynucleotides of the Sequence Listing under stringent hybridization conditions with a 
probe having the sequence of said polynucleotide sequence set forth in said polynucleotide 
of the Sequence Listing or a fragment thereof; and isolating said DNA sequence. 
Fragments useful for obtaining such a polynucleotide include, for example, probes and 
primers described elsewhere herein. 

10 
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"Identity," as known in the art and used herein, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as the case may be, as determined by the match 
5 between strings of such sequences. "Identity" and "similarity" can be readily calculated by 
known methods, including but not limited to those described in {Computational Molecular 
Biology, Lesk. A.M., ed.. Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D.W., ed., Academic Press. New York, 1993; 
Computer Analysis of Sequence Data, Part I. Griffin. A.M.. and Griffin. H.G.. eds., 
10 HumanaPress, New Jersey. 1994; Sequence Analysis in Molecular Biology, von Heinje, G., 
Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., 
M Stockton Press, New York. 1991; and Carillo. H.. and Lipman, D., SIAM J. Applied 
Math., 48: 1073 (1988). Preferred methods to determine identity are designed to give the 
largest match between the sequences tested. Methods to determine identity and similarity 
are codified in publicly available computer programs. Preferred computer program methods 
to determine identity and similarity between two sequences include, but are not limited to, 
the GCG program package (Devereux, J., et al.. Nucleic Acids Research 12(1): 387 (1984)). 
BLASTP, BLASTN, and FASTA (Atschul, S.F. et al., J. Molec. Biol. 215: 403-410 (1990). 
The BLAST X program is publicly available from NCBI and other sources (BLAST 
Manual, Altschul, S.. et al., NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. 
Mol. Biol. 215: 403-410 (1990). 

As an illustration, by a polynucleotide having a nucleotide sequence having at least, 
for example, 95% "identity" to a reference nucleotide sequence selected from the group 
consisting of the polynucleotide of the Sequence Listing is intended that the nucleotide 
sequence of the polynucleotide is identical to the reference sequence except that the 
polynucleotide sequence may include up to five point mutations per each 100 nucleotides of 
the reference nucleotide sequence. In other words, to obtain a polynucleotide having a 
nucleotide sequence at least 95% identical to a reference nucleotide sequence, up to 5% of 
the nucleotides in the reference sequence may be deleted or substituted with another 
nucleotide, or a number of nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the reference 
sequence may occur at the 5 or3 terminal positions of the reference nucleotide sequence 
or anywhere between those terminal positions, interspersed either individually among 
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nucleotides in the rcf rence sequence or in one or more contiguous groups within the 
reference sequence. 

Similarly, by a polypeptide having an amino acid sequence having at least, 
for example, 95% identity to a reference amino acid sequence of selected from the 

5 group consisting of the amino acids of the Sequence Listing is intended that the 

amino acid sequence of the polypeptide is identical to the reference sequence except 
that the polypeptide sequence may include up to five amino acid alterations per 
each 100 amino acids of the reference amino acid. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference 

10 amino acid sequence, up to 5% of the amino acid residues in the reference sequence 
may be deleted or substituted with another amino acid, or a number of amino acids 
up to 5% of the total amino acid residues in the reference sequence may be inserted 
into the reference sequence. These alterations of the reference sequence may occur 
at the amino or carboxy terminal positions of the reference amino acid sequence or 

15 anywhere between those terminal positions, interspersed either individually among 
residues in the reference sequence or in one or more contiguous groups within the 
reference sequence. 

The deposit referred to herein will be maintained under the terms of the Budapest 
Treaty on the International Recognition of the Deposit of Micro-organisms for purposes of 

20 Patent Procedure. These deposits are provided merely as convenience to those of skill in 
the art and are not an admission that a deposit is required under 35 U.S.C. §112. The 
sequence of the polynucleotides contained in the deposited material, as well as the amino 
acid sequence of the polypeptides encoded thereby, are incorporated herein by reference and 
are controlling in the event of any conflict with any description of sequences herein. A 

25 license may be required to make, use or sell the deposited material, and no such license is 
hereby granted. 

The terms "fragment," "derivative" and "analogue" when referring to the 
polypeptide characterized by the deduced amino acid sequence herein, means a polypeptide 
which retains essentially the same biological function or activity as such polypeptide. 
30 Thus, an analogue includes a proprotein which can be activated by cleavage of the 
proprotein portion to produce an active mature polypeptide. 

The polypeptide of the present invention may be a recombinant polypeptide, a 
natural polypeptide or a synthetic polypeptide, preferably a recombinant polypeptide. 
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The fragment, derivative or analogue of the polypeptide characterized by the 
deduced amino acid sequence herein may be (i) one in which one or more of the amino acid 
residues are substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not be 
5 one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which the polypeptide is fused with 
another compound, such as a compound to increase the half-life of the polypeptide (for 
example, polyethylene glycol), or (iv) one in which the additional amino acids are fused to 
the polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
10 purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogues are deemed to be within the scope of those skilled in the art from the teachings 
herein. 

The polypeptides and polynucleotides of the present invention are preferably 
provided in an isolated form, and preferably are purified to homogeneity. 
1 5 The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For example, a 
naturally-occurring polynucleotide or polypeptide present in a living animal is not isolated, 
but the same polynucleotide or polypeptide, separated from some or all of the coexisting 
materials in the natural system, is isolated. Such polynucleotides could be part of a vector 
20 and/or such polynucleotides or polypeptides could be part of a composition, and still be 
isolated in that such vector or composition is not part of its natural environment. 

The present invention also relates to vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the invention 
and the production of polypeptides of the invention by recombinant techniques. 
25 In accordance with yet a further aspect of the present invention, there is therefore 

provided a process for producing the polypeptide of the invention by recombinant 
techniques by expressing a polynucleotide encoding said polypeptide in a host and 
recovering the expressed product. Alternatively, the polypeptides of the invention can be 
synthetically produced by conventional peptide synthesizers. 
30 Host cells are genetically engineered (transduced or transformed or transfected) 

with. the vectors of this invention which may be, for example, a cloning vector or an 
expression vector. The vector may be. for example, in the form of a plasmid, a cosmid. a 
phage, etc. The engineered host cells can be cultured in conventional nutrient media 
modified as appropriate for activating promoters, selecting transformants or amplifying the 
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genes. The culture conditions, such as temperature, pH and the like, are those previously 
used with the host cell selected for expression, and will be apparent to the ordinarily skilled 
artisan. 

Suitable expression vectors include chromosomal, nonchromosomal and synthetic 
5 DNA sequences, e.g., bacterial plasmids; phage DNA; baculovirus; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA. However, any other vector may 
be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into an appropriate restriction 

10 endonuclease site(s) by procedures known in the art. 

The DNA sequence in the expression vector is operatively linked to an appropriate 
expression control sequence(s) (promoter) to direct mRNA synthesis. As representative 
examples of such promoters, there may be mentioned: LTR or SV40 promoter, the £. coli. 
lac or trp t the phage lambda promoter and other promoters known to control expression 

15 of genes in eukaryotic or prokaryotic cells or their viruses. The expression vector also 

contains a ribosome binding site for translation initiation and a transcription terminator. The 
vector may also include appropriate sequences for amplifying expression. 

In addition, the expression vectors preferably contain one or more selectable marker 
genes to provide a phenotypic trait for selection of transformed host cells such as 

20 dihydrofolate reductase or neomycin resistance for eukaryotic cell culture, or such as 
tetracycline or ampicillin resistance in E. coli. 

The gene can be placed under the control of a promoter, ribosome binding site (for 
bacterial expression) and, optionally, an operator (collectively referred to herein as "control" 
elements), so that the DNA sequence encoding the desired protein is transcribed into RNA 

25 in the host cell transformed by a vector containing this expression construction. The coding 
sequence may or may not contain a signal peptide or leader sequence. The polypeptides of 
the present invention can be expressed using, for example, the E. coli tac promoter or the 
protein A gene (spa) promoter and signal sequence. Leader sequences can be removed by 
the bacterial host in post-translational processing. See, e.g., U.S. Patent Nos. 4,431,739; 

30 4,425,437; 4,338,397. Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are PKK232-8 and PCM7. Particular named bacterial promoters 
include lad, lacZ, T3, T7, gpt, lambda Pr, Pl and trp. Eukaryotic promoters include CMV 
immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and 
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mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

In addition to control sequences, it may be desirable to add regulatory sequences 
which allow for regulation of the expression of the protein sequences relative to the growth 
5 of the host cell. Regulatory sequences are known to those of skill in the art, and examples 
include those which cause the expression of a gene to be turned on or off in response to a 
chemical or physical stimulus, including the presence of a regulatory compound. Other 
types of regulatory elements may also be present in the vector, for example, enhancer 
sequences. 

10 An expression vector is constructed so that the particular coding sequence is located 

in the vector with the appropriate regulatory sequences, the positioning and orientation of 
the coding sequence with respect to the control sequences being such that the coding 
sequence is transcribed under the "control" of the control sequences (i.e., RNA polymerase 
which binds to the DNA molecule at the control sequences transcribes the coding 
15 sequence). Modification of the coding sequences may be desirable to achieve this end. For 
example, in some cases it may be necessary to modify the sequence so that it may be 
attached to the control sequences with the appropriate orientation; i.e., to maintain the 
reading frame. The control sequences and other regulatory sequences may be ligated to the 
coding sequence prior to insertion into a vector, such as the cloning vectors described 
20 above. Alternatively, the coding sequence can be cloned directly into an expression vector 
which already contains the control sequences and an appropriate restriction site. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene off. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly- 
25 expressed gene to direct transcription of a downstream structural sequence. The 

heterologous structural sequence is assembled in appropriate phase with translation 
initiation and termination sequences, and preferably, a leader sequence capable of directing 
secretion of translated protein into the periplasmic space or extracellular medium. 
Optionally, the heterologous sequence can encode a fusion protein including an N-terminal 
30 identification peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

The vector containing the appropriate DNA sequence as hereinabove described, as 
well as an appropriate promoter or control sequence, may be employed to transform an 
appropriate host to permit the host to express the protein. 

15 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, 
a promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 
following vectors are provided by way of example. Bacterial: pET-3 vectors (Stratagene), 
pQE70, pQE60, pQE-9 (Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, 
10 pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, 

pDR540, pRIT5 (Pharmacia). Eukaryotic: pBIueBacIII (Invitrogen), pWLNEO, pSV2CAT, 
pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, 
any other plasmid or vector may be used as long as they are replicable and viable in the 
host. 

15 Examples of recombinant DNA vectors for cloning and host cells which they can 

transform include the bacteriophage 1 (£. coli), pBR322 (£. coli), pACYC177 (E. coli), 
pKT230 (gram-negative bacteria), pGVl 106 (gram-negative bacteria), pLAFRl (gram- 
negative bacteria), pME290 (non-£. coli gram-negative bacteria), pHV14 (£. coli and 
Bacillus subtilis), pBD9 (Bacillus), pIJ61 (Streptomyces). pUC6 (Streptomyces), YIp5 

20 (Saccharomyces), a baculovirus insect cell system, YCpl9 (Saccharomyces). See, 

generally, "DNA Cloning": Vols. I & II, Glover et al ed. IRL Press Oxford (1985) (1987) 
and; T. Maniatis et aL ("Molecular Cloning" Cold Spring Harbor Laboratory (1982). 

In some cases, it may be desirable to add sequences which cause the secretion of the 
polypeptide from the host organism, with subsequent cleavage of the secretory signal. 

25 Polypeptides can be expressed in host cells under the control of appropriate 

promoters. Cell-free translation systems can also be employed to produce such proteins 
using RNAs derived from the DNA constructs of the present invention. Appropriate 
cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described 
by Sambrook, et al. t Molecular Cloning: A Laboratory Manual, Second Edition, Cold 

30 Spring Harbor, N.Y., (1989), the disclosure of which is hereby incorporated by reference. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 
temperature shift or chemical induction) and cells are cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication. mechanical disruption, or use 
5 of cell lysing agents, such methods are well known to those skilled in the art. 

Depending on the expression system and host selected, the polypeptide of the 
present invention may be produced by growing host cells transformed by an expression 
vector described above under conditions whereby the polypeptide of interest is expressed. 
The polypeptide is then isolated from the host cells and purified. If the expression system 
10 secretes the polypeptide into growth media, the polypeptide can be purified directly from 
the media. If the polypeptide is not secreted, it is isolated from cell lysates or recovered 
from the cell membrane fraction. Where the polypeptide is localized to the cell surface, 
whole cells or isolated membranes can be used as an assayable source of the desired gene 
product. Polypeptide expressed in bacterial hosts such as £. coli may require isolation from 
15 inclusion bodies and refolding. Where the mature protein has a very hydrophobic region 
which leads to an insoluble product of overexpression, it may be desirable to express a 
truncated protein in which the hydrophobic region has been deleted. The selection of the 
appropriate growth conditions and recovery methods are within the skill of the art. 

The polypeptide can be recovered and purified from recombinant cell cultures by 
20 methods including ammonium sulphate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography and 
lectin chromatography. Protein refolding steps can be used, as necessary, in completing 
configuration of the mature protein. Finally, high performance liquid chromatography 
25 (HPLC) can be employed for final purification steps. 

Depending upon the host employed in a recombinant production procedure, the 
polypeptides of the present invention may be glycosylated or may be non-glycosylated. 
Polypeptides of the invention may also include an initial methionine amino acid residue. 

A "replicon" is any genetic element (e.g., plasmid, chromosome, virus) that 
functions as an autonomous unit of DNA replication in vivo; i.e., capable of replication 
under its own control. 

A "vector" is a replicon, such as a plasmid. phage, or cosmid, to which another 
DNA segment may be attached so as to bring about the replication of the attached segment. 
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A "doubJe-stranded DNA molecule" refers to the polymeric form of 
deoxyribonucleotides (bases adenine, guanine, thymine, or cytosine) in a double-stranded 
helix, both relaxed and supercoiled. This t rm refers only to the primary and secondary 
structure of the molecule, and does not limit it to any particular tertiary forms. Thus, this 
term includes double-stranded DNA found, inter alia, in linear DNA molecules (e.g., 
restriction fragments), viruses, plasmids, and chromosomes. In discussing the structure of 
particular double-stranded DNA molecules, sequences may be described herein according to 
the normal convention of giving only the sequence in the 5* to 3' direction along the 
nontranscribed strand of DNA (i.e., the strand having the sequence homologous to the 
mRNA). 

A DNA "coding sequence of or a "nucleotide sequence encoding" a particular 
protein, is a DNA sequence which is transcribed and translated into a polypeptide when 
placed under the control of appropriate regulatory sequences. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3' direction) coding 
sequence. For purposes of defining the present invention, the promoter sequence is bound 
at the 3' terminus by a translation start codon (e.g., ATG) of a coding sequence and extends 
upstream (5' direction) to include the minimum number of bases or elements necessary to 
initiate transcription at levels detectable above background. Within the promoter sequence 
will be found a transcription initiation site (conveniently defined by mapping with nuclease 
SI), as well as protein binding domains (consensus sequences) responsible for the binding 
of RNA polymerase. Eukaryotic promoters will often, but not always, contain "TATA" 
boxes and "CAT" boxes. Prokaryotic promoters contain Shine-Dalgarno sequences in 
addition to the -10 and -35 consensus sequences. 

DNA "control sequences" refers collectively to promoter sequences, ribosome 
binding sites, polyadenylation signals, transcription termination sequences, upstream 
regulatory domains, enhancers, and the like, which collectively provide for the expression 
(i.e., the transcription and translation) of a coding sequence in a host cell. 

A control sequence "directs the expression" of a coding sequence in a cell when 
RNA polymerase will bind the promoter sequence and transcribe the coding sequence into 
mRNA, which is then translated into the polypeptide encoded by the coding sequence. 

A "host ceU" is a cell which has been transformed or transfected, or is capable of 
transformation or transfection by an exogenous DNA sequence. 
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A cell has been "transformed" by exogenous DNA when such exogenous DNA has 
been introduced inside the cell membrane. Exogenous DNA may or may not be integrated 
(covalently linked) into chromosomal DNA making up the genome of the cell. In 
prokaryotes and yeasts, for example, the exogenous DNA may be maintained on an 
5 episomal element, such as a plasmid. With respect to eukaryotic cells, a stably transformed 
or transfected cell is one in which the exogenous DNA has become integrated into the 
chromosome so that it is inherited by daughter cells through chromosome replication. This 
stability is demonstrated by the ability of the eukaryotic cell to establish cell lines or clones 
comprised of a population of daughter cell containing the exogenous DNA. 
10 A "clone" is a population of cells derived from a single cell or common ancestor by 

mitosis. A "cell line" is a clone of a primary cell that is capable of stable growth in vitro for 
many generations. 

A "heterologous" region of a DNA construct is an identifiable segment of DNA 
within or attached to another DNA molecule that is not found in association with the other 
15 molecule in nature. 

This invention is also related to the use of the polynucleotides of the invention for use as 
diagnostic reagents. Detection of a polynucleotide or polypeptide of the invention in a 
eukaryote. particularly a mammal, and especially a human, will provide a diagnostic method for 
diagnosis of a disease. Eukaryotes (herein also "individual(s)"), particularly mammals, and 
20 especially humans, infected with an organism comprising a polynucleotide of the invention may 
be detected at the nucleic acid level by a variety of techniques. 

Nucleic acids for diagnosis may be obtained from an infected individual's cells and 
tissues, such as bone, blood, muscle, cartilage, and skin. Genomic DNA may be used directly for 
detection or may be amplified enzymatically by using PCR or other amplification technique prior 
to analysis. RNA or cDNA may also be used in the same ways. Using amplification, 
characterization of the species and strain of prokaryote present in an individual, may be made by 
an analysis of the genotype of the prokaryote gene. Deletions and insertions can be detected by a 
change in size of the amplified product in comparison to the genotype of a reference sequence. 
Point mutations can be identified by hybridizing amplified DNA to labeled polynucleotide 
sequences of the invention. Perfectly matched sequences can be distinguished from mismatched 
duplexes by RNase digestion or by differences in melting temperatures. DNA sequence 
differences may also be detected by alterations in the electrophoretic mobility of the DNA 
fragments in gels, with or without denaturing agents, or by direct DNA sequencing. See, e.g., 
Myers et al., Science, 230: 1242 (1985). Sequence changes at specific locations also may be 
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revealed by nuclease protection assays, such as RNase and SI protection or a chemical cleavage 

method. See, e.g., Cott n et al., Proc. Natl. Acad. ScL. USA, 85: 4397-4401 

(1985). 

Cells carrying mutations or polymorphisms in the gene of the invention may also be 
5 detected at the DNA level by a variety of techniques, to allow for serotyping, for example. For 
example, RT-PCR can be used to detect mutations. It is particularly preferred to used RT-PCR 
in conjunction with automated detection systems, such as, for example, GeneScan. RNA or 
cDNA may also be used for the same purpose, PCR or RT-PCR. As an example, PCR primers 
complementary to a nucleic acid encoding a polypeptide of the invention can be used to identify 

10 and analyze mutations. These primers may be used for, among othe4r things, amplifying DNA 
isolated from a sample derived from an individual. The primers may be used to amplify the gene 
isolated from an infected individual such that the gene may then be subject to various techniques 
for elucidation of the DNA sequence. In this way, mutations in the DNA sequence may be 
detected and used to diagnose infection and to serotype and/or classify the infectious agent. 

15 The invention further provides a process for diagnosing, disease, preferably bacterial 

infections, more preferably infections by Streptococcus pneumoniae, comprising determining 
from a sample derived from an individual a increased level of expression of polynucleotide 
having a sequence set forth in the Sequence Listing or a sequence of the invention. 
Increased or decreased expression of a polynucleotide of the invention can be measured 

20 using any on of the methods well known in the art for the quantation of polynucleotides, 
such as, for example, amplification, PCR, RT-PCR, RNase protection, Northern blotting 
and other hybridization methods. 

In addition, a diagnostic assay in accordance with the invention for detecting over- 
expression of a polypeptide of the invention compared to normal control tissue samples may be 

25 used to detect the presence of an infection, for example. Assay techniques that can be used to 
determine levels of a polypeptide of the invention, in a sample derived from a host are well- 
known to those of skill in the art. Such assay methods include radioimmunoassays, competiuve- 
binding assays. Western Blot analysis and ELISA assays. 

In accordance with yet a further aspect of the present invention, there is provided 

30 the use of a polypeptide of the invention for therapeutic or prophylactic purposes, for 
example, as an antibacterial agent or a vaccine. 

In accordance with another aspect of the present invention, there is provided the use 
of a polynucleotide of the invention for therapeutic or prophylactic purposes, in particular 
genetic immunisation. 
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In accordance with yet another aspect of the present invention, there are provided 
inhibitors to such polypeptides, useful as antibacterial agents. In particular, there are 
provided antibodies against such polypeptides. 

Polypeptides of the invention may also be used to assess the binding of small molecule 
5 substrates and ligands in, for example, cells, cell-free preparations, chemical libraries, and natural 
product mixtures. These substrates and ligands may be natural substrates and ligands or may be 
structural or functional mimetics. See, e.g., Coligan et a/.. Current Protocols in Immunology 
7(2,1; Chapter 5 (1991). 

The invention also provides a method of screening compounds to identify those which 
10 enhance (agonist) or block (antagonist) the action of polypeptides or polynucleotides of the 
invention, particularly those compounds that are bacteriostatic and/or bacteriocidal. The method 
of screening may involve high-throughput techniques. For example, to screen for agonists or 
antagoists, a synthetic reaction mix, a cellular compartment, such as a membrane, cell envelope 
or cell wall, or a preparation of any thereof, comprising a polypeptide of the invention and a 
15 labeled substrate or ligand of such polypeptide is incubated in the absence or the presence of a 
candidate molecule that may be an agonist or antagonist of a polypeptide of the invention. The 
ability of the candidate molecule to agonize or antagonize a polypeptide of the invention is 
reflected in decreased binding of the labeled ligand or decreased production of product from such 
substrate. Molecules that bind gratuitously, i.e., without inducing the effects of a polypeptide of 
20 the invention are most likely to be good antagonists. Molecules that bind well and increase the 
rate of product production from substrate are agonists. Detection of the rate or level of production 
of product from substrate may be enhanced by using a reporter system. Reporter systems that 
may be useful in this regard include but are not limited to colorirnetric labeled substrate 
converted into product, a reporter gene that is responsive to changes in polynucleotide or 
25 polypeptide activity, and binding assays known in the art. 

Another example of an assay for antagonists of a polypeptide of the invention is a 
competitive assay that combines such polypeptide and a potential antagonist with polypeptide, 
binding molecules, recombinant polypeptide-binding molecules, natural substrates or ligands, or 
substrate or ligand mimetics, under appropriate conditions for a competitive inhibition assay. 
30 Polypeptides of the invention can be labeled, such as by radioactivity or a colorirnetric 
compound, such that the number of such polypeptide molecules bound to a binding molecule or 
converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 
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Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotid or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, a peptide, a 
polypeptide such as a closely related protein or antibody that binds the same sites on a binding 
5 molecule, without inducing an activities of a polypeptide of the invention, thereby preventing the 
action of such polypeptide by excluding it from binding. 

Potential antagonists include a small molecule that binds to and occupies the binding 
site of the polypeptide thereby preventing binding to cellular binding molecules, such that 
normal biological activity is prevented. Examples of small molecules include but are not limited 
10 to small organic molecules, peptides or peptide-like molecules. Other potential antagonists 
include antisense molecules (see Okano, / Neurochem. 56: 560 (1991); 
OUGODEOXYNUCLEOT1DES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC 
Press, Boca Raton, FL (1988), for a description of these molecules). Preferred potential 
antagonists include compounds related to and variants of the polypetides of the invention. 
15 Each of the DNA sequences provided herein may be used in the discovery and 

development of antibacterial compounds. The encoded protein, upon expression, can be 
used as a target for the screening of antibacterial drugs. Additionally, the DNA sequences 
encoding the amino terminal regions of the encoded protein or Shine-Delgarno or other 
translation facilitating sequences of the respective mRNA can be used to construct antisense 
20 sequences to control the expression of the coding sequence of interest. 

The invention also provides the use of the polypeptide, polynucleotide or inhibitor 
of the invention to interfere with the initial physical interaction between a pathogen and 
mammalian host responsible for sequelae of infection. In particular the molecules of the 
invention may be used: in the prevention of adhesion of bacteria, in particular gram positive 
25 bacteria, to mammalian extracellular matrix proteins on in-dwelling devices or to 
extracellular matrix proteins in wounds; to block protein-mediated mammalian cell invasion 
by, for example, initiating phosphorylation of mammalian tyrosine kinases (Rosenshine et 
a/., Infect Immun. 60:2211 (1992); to block bacterial adhesion between mammalian 
extracellular matrix proteins and bacterial proteins that mediate tissue damage and; to block 
30 the normal progression of pathogenesis in infections initiated other than by the implantation 
of in-dwelling devices or by other surgical techniques. 

The antagonists and agonists of the invention may be employed, for instance, to inhibit 
and treat otitis media, conjunctivitis, pneumonia, bacteremia, meningitis, sinusitis, pleural 
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empyema and endocarditis, and most particularly meningitis, such as for example infection of 
cerebrospinal fluid. 

Another aspect of the invention is a pharmaceutical composition comprising the 
above polypeptide, polynucleotide or inhibitor of the invention and a pharmaceutically 
5 acceptable carrier. 

In a particular aspect the invention provides the use of an inhibitor of the invention 
as an antibacterial agent. 

The invention further relates to the manufacture of a medicament for such uses. 

The polypeptide may be used as an antigen for vaccination of a host to produce 
10 specific antibodies which have anti-bacterial action. This invention also contemplates the 
use of the DNA encoding the antigen as a component in a DNA vaccine as discussed more 
fully below. 

The polypeptides or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or monoclonal 
15 antibodies. The term antibodies also includes chimeric, single chain, and humanized 

antibodies, as well as Fab fragments, or the product of an Fab expression library. Various 
procedures known in the art may be used for the production of such antibodies and 
fragments. 

Antibodies generated against the polypeptides of the present invention can be 
20 obtained by direct injection of the polypeptides into an animal or by administering the 

polypeptides to an animal, preferably a nonhuman. The antibody so obtained will then bind 
the polypeptides itself. In this manner, even a sequence encoding only a fragment of the 
polypeptides can be used to generate antibodies binding the whole native polypeptides. 
Such antibodies can then be used to isolate the polypeptide from tissue expressing that 
25 polypeptide. 

Polypeptide derivatives include antigenically or immunologically equivalent 
derivatives which form a particular aspect of this invention. 

The term 'antigenically equivalent derivative 1 as used herein encompasses a 
polypeptide or its equivalent which will be specifically recognised by certain antibodies 
which, when raised to the protein or polypeptide according to the present invention, 
interfere with the interaction between pathogen and mammalian host. 

The term 'immunologically equivalent derivative' as used herein encompasses a 
peptide or its equivalent which when used in a suitable formulation to raise antibodies in a 
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vertebrate, the antibodies act to interfere with the interaction between pathogen and 
mammalian host. 

In particular derivatives which are slightly longer or slightly shorter than the native 
protein or polypeptide fragment of the present invention may be used. In addition, 

5 polypeptides in which one or more of the amino acid residues are modified may be used. 
Such peptides may, for example, be prepared by substitution, addition, or rearrangement of 
amino acids or by chemical modification thereof. All such substitutions and modifications 
are generally well known to those skilled in the art of peptide chemistry. 

The polypeptide, such as an antigenically or immunologically equivalent derivative 

10 or a fusion protein thereof is used as an antigen to immunize a mouse or other animal such 
as a rat or chicken. The fusion protein may provide stability to the polypeptide. The 
antigen may be associated, for example by conjugation , with an immunogenic carrier 
protein for example bovine serum albumin (BSA) or keyhole limpet haemocyanin (KLH). 
Alternatively a multiple antigenic peptide comprising multiple copies of the protein or 

1 5 polypeptide, or an antigenically or immunologically equivalent polypeptide thereof may be 
sufficiently antigenic to improve immunogenicity so as to obviate the use of a carrier. 

For preparation of monoclonal antibodies, any technique which provides antibodies 
produced by continuous cell line cultures can be used. Examples include the hybridoma 
technique (Kohler and Milstein, Nature . 256:495-497(1975)), the trioma technique, the 

20 human B-cell hybridoma technique (Kozbor et al. f Immunology Today 4:72(1983)), and the 
EBV-hybridoma technique to produce human monoclonal antibodies (Cole, et al., 1985, in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. Patent 
4,946,778) can be adapted to produce single chain antibodies to immunogenic polypeptide 

25 products of this invention. 

Using the procedure of Kohler and Milstein (supra . (1975)), antibody-containing 
cells from the immunised mammal are fused with myeloma cells to create hybridoma cells 
secreting monoclonal antibodies. 

The hybridomas are screened to select a cell line with high binding affinity and 

30 favorable cross reaction with other Streptococcal species using one or more of the original 
polypeptide and/or the fusion protein. The selected cell line is cultured to obtain the desired 
Mab. 

Hybridoma cell lines secreting the monoclonal antibody are another aspect of this 
invention. 
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Alternatively phage display technology could be utilised to select antibody genes 
with binding activities towards the polypeptide either from repertoires of PCR amplified v- 
genes of lymphocytes from humans screened for possessing anti-Fbp or from naive libraries 
(McCafferty, J. et ai, Mature 348:552-554(1990). and Marks, J. et ai, Biotechnolop v 
5 10:779-783( 1992)). The affinity of these antibodies can also be improved by chain 
shuffling (Clackson, T. et al.. Nature 352:624-628(1991)). 

The antibody should be screened again for high affinity to the polypeptide and/or 
fusion protein. 

As mentioned above, a fragment of the final antibody may be prepared. 
1 0 The antibody may be either intact antibody of M r approx 1 50,000 or a derivative of 

it, for example a Fab fragment or a Fv fragment as described in Skerra, A and Pluckthun, 
A., Scjfijjcs 240: 1038-1040 (1988). If two antigen binding domains are present each 
domain may be directed against a different epitope - termed 'bispecific 1 antibodies. 

The antibody of the invention may be prepared by conventional means for example 
15 by established monoclonal antibody technology (Kohler, G. and Milstein. C. (aipja, (1975) 
or using recombinant means e.g. combinatorial libraries, for example as described in Huse. 
W.D.er al.. Science 246:1275-1281 (1989). 

Preferably the antibody is prepared by expression of a DNA polymer encoding said 
antibody in an appropriate expression system such as described above for the expression of 
20 polypeptides of the invention. The choice of vector for the expression system will be 

determined in part by the host, which may be a prokaryotic cell, such as E. coli (preferably 
strain B) or Streptomyces sp. or a eukaryotic cell, such as a mouse C127, mouse myeloma, 
human HeLa, Chinese hamster ovary, filamentous or unicellular fungi or insect cell. The 
host may also be a transgenic animal or a transgenic plant (for example, as described in 
Hiatt, A. etal, UsSSUS. 340:76-78(1989). Suitable vectors include plasmids, bacteriophages, 
cosmids and recombinant viruses, derived from, for example, baculoviruses and vaccinia. 

The Fab fragment may also be prepared from its parent monoclonal antibody by 
enzyme treatment, for example using papain to cleave the Fab portion from the Fc portion. 

Preferably the antibody or derivative thereof is modified to make it less 
immunogenic in the patient. For example, if the patient is human the antibody may most 
preferably be "humanised'; where the complimentarity determining region(s) of the 
hybridoma-derived antibody has been transplanted into a human monoclonal antibody , for 
example as described in Jones, P. et ai. Manjre_32 1:522-525 (1986), or Tempest et al.. 
Biotechnology 9:266-273 (1991). 
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The modification need not be restricted to one of 'humanisation' ; other primate 
sequences (for example Newman, R. et ai, Biotechnology 10: 1455-1460 (1992)) may also 
be used. 

The humanised monoclonal antibody, or its fragment having binding activity, form 
a particular aspect of this invention. 

This invention provides a method of screening drugs to identify those which 
interfere with the proteins selected as targets herein, which method comprises measuring the 
interference of the activity of the protein by a test drug. For example if the protein selected 
has a catalytic activity, after suitable purification and formulation the activity of the enzyme 
can be followed by its ability to convert its natural substrates. By incorporating different 
chemically synthesised test compounds or natural products into such an assay of enzymatic 
activity one is able to detect those additives which compete with the natural substrate or 
otherwise inhibit enzymatic activity. 

The invention also relates to inhibitors identified thereby. 
The use of a polynucleotide of the invention in genetic immunisation will 
preferably employ a suitable delivery method such as direct injection of plasmid DNA into 
muscles (Wolff et ai , Hum. Mol. Genet. 1:363 (1 992); Manthorpe et a/., Him Gene Th**"- 
4:419 (1963)), delivery of DNA complexed with specific protein carriers ( Wu et ai, L 
Biol. Chem . 264:16985 (1989)), coprecipitation of DNA with calcium phosphate 
(Benvenisty & Reshef, Proc. Nat'l Acad. Sci. USA . 83:9551 (1986)), encapsulation of DNA 
in various forms of liposomes (Kaneda et ai, Science 243:375 (1989)), particle 
bombardment (Tang et ai. Nature 356:152 (1992)); Eisenbraun et ai, PNA Cell 3iQl. 
12:791 (1993)) and in vivo infection using cloned retroviral vectors (Seegere/af., Pr0C- 
Natl. Acad- Sci. USA 81:5849 (1984)). Suitable promoters for muscle transfection include 
CMV, RSV, SRa, actin, MCK, alpha globin, adenovirus and dihydrofolate reductase. 

In therapy or as a prophylactic, the active agent i.e., the polypeptide, polynucleotide 
or inhibitor of the invention, may be administered to a patient as an injectable composition, 
for example as a sterile aqueous dispersion, preferably isotonic. 

Alternatively the composition may be formulated for topical application 
for example in the form of ointments, creams, lotions, eye ointments, eye drops, ear drops, 
mouthwash, impregnated dressings and sutures and aerosols, and may contain appropriate 
conventional additives, including, for example, preservatives, solvents to assist drug 
penetration, and emollients in ointments and creams. Such topical formulations may also 
contain compatible conventional carriers, for example cream or ointment bases, and ethanol 
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or oleyl alcohol for lotions. Such carriers may constitute from about 1% to about 98% by 
weight of the formulation; more usually they will constitute up to about 80% by weight of 
the formulation. 

For administration to human patients, it is expected that the daily dosage level of 
5 the active agent will be from 0.01 to 10 mg/kg, typically around 1 mg/kg. The physician in 
any event will determine the actual dosage which will be most suitable for an individual 
patient and will vary with the age. weight and response of the particular patient. The above 
dosages are exemplary of the average case. There can. of course, be individual instances 
where higher or lower dosage ranges are merited, and such are within the scope of this 
10 invention. 

A vaccine composition is conveniently in injectable form. Conventional adjuvants 
may be employed to enhance the immune response. 

A suitable unit dose for vaccination is 0.5-5 ug/kg of antigen, and such dose is 
preferably administered 1-3 times and with an interval of 1-3 weeks. 

Within the indicated dosage range, no adverse toxicologicals effects are expected 
with the compounds of the invention which would preclude their administration to suitable 
patients. 

In order to facilitate understanding of the following example certain frequently 
occurring methods and/or terms will be described. 
20 "Plasmids" are designated by a lower case p preceded and/or followed by capital 

letters and/or numbers. The starting plasmids herein are either commercially available, 
publicly available on an unrestricted basis, or can be constructed from available plasmids in 
accord with published procedures. In addition, equivalent plasmids to those described are 
known in the art and will be apparent to the ordinarily skilled artisan. 

"Digestion" of DNA refers to catalytic cleavage of the DNA with a restriction 
enzyme that acts only at certain sequences in the DNA. The various restriction enzymes 
used herein are commercially available and their reaction conditions, cofactors and other 
requirements were used as would be known to the ordinarily skilled artisan. For analytical 
purposes, typically I ug of plasmid or DNA fragment is used with about 2 units of enzyme 
in about 20 ul of buffer solution. For the purpose of isolating DNA fragments for plasmid 
construction, typically 5 to 50 ug of DNA are digested with 20 to 250 units of enzyme in a 
larger volume. Appropriate buffers and substrate amounts for particular restriction enzymes 
are specified by the manufacturer. Incubation times of about 1 hour at 37° C are ordinarily 
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used, but may vary in accordance with the supplier's instructions. After digestion the 
reaction is electrophoresed directly on a polyacrylamide gel to isolate the desired fragment. 

Size separation of the cleaved fragments is performed using 8 percent 
polyacrylamide gel described by Goeddel, D. et al y (1980) Nucleic Acids Res., 8:4057. 
5 "Oligonucleotides" refers to either a single stranded polydeoxynucleotide or two 

complementary polydeoxynucleotide strands which may be chemically synthesized. Such 
synthetic oligonucleotides have no 5' phosphate and thus will not ligate to another 
oligonucleotide without adding a phosphate with an ATP in the presence of a kinase. A 
synthetic oligonucleotide will ligate to a fragment that has not been dephosphorylated. 

10 "Ligation" refers to the process of forming phosphodiester bonds between two 

double stranded nucleic acid fragments (Maniatis, T., et al. y supra., p. 146). Unless 
otherwise provided, ligation may be accomplished using known buffers and conditions with 
10 units to T4 DNA ligase ("ligase") per 0.5 ug of approximately equimolar amounts of the 
DNA fragments to be ligated. 

15 Example 1 

Isolation of DNA coding for a virulence gene in Streptococcus pneumoniae 

As mentioned above each of the DNAs disclosed herein by virtue of the fact that it 
includes an intact open reading frame is useful to a greater or lesser extent as a screen for 
identifying antimicrobial compounds. A useful approach for selecting the preferred DNA 

20 sequences for screen development is evaluation by insertion-duplication mutagenesis. This 
system disclosed by Morrison et al., J. Bacteriol . 159:870 (1984), is applied as follows. 

Briefly, random fragments of Streptococcus pneumoniae, strain 0100993 DNA are 
generated enzymatically (by restriction endonuclease digestion) or physically (by sonication 
based shearing) followed by gel fractionation and end repair employing T4 DNA _ 

25 polymerase. It is preferred that the DNA fragments so produced are in the range of 200-400 
base pairs, a size sufficient to ensure homologous recombination and to insure a 
representative library in ExolL The fragments are then inserted into appropriately tagged 
plasmids as described in Hensel el al, Science 269: 400-403( 1 995). Although a number of 
plasmids can be used for this purpose, a particularly useful plasmid is pJDC9 described by 

30 Pearce et al., MoL Microbiol . 9:1037 (1993) which carries the erm gene facilitating 
erythromycin selection in either £. colt or S. pneumoniae previously modified by 
incorporation of DNA sequence tags into one of the polylinker cloning sites. The tagged 
plasmids are introduced into the appropriate 5. pneumoniae strain selected, inter alia, on the 
basis of serotype and virulence in a murine model of pneumococcal pneumonia. 
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It is appreciated that a seventeen amino acid competence factor exists (Havastein et 
al.. fins. iMi. ASM. SsL USA 92:1 1 14(M4 (1995)) and may be usefully employed in this 
protocol to increase the transformation frequencies. A proportion of transformants are 
analysed to verify homologous integration and as a check on stability. Unwanted levels of 
5 reversion are minimized because the duplicated regions will be short (200-400 bp), however 
if significant reversion rates are encountered they may be modulated by maintaining 
antibiotic selection during the growth of the transformants in culture and/or during growth 
in the animal. 

The S. pneumoniae transformants are pooled for inoculation into mice, eg., Swiss 
10 and/or C57B 1/6. Preliminary experiments are conducted to establish the optimum 
complexity of the pools and level of inoculum. A particularly useful model has been 
described by Veber et al. (J, Antimicrohiol. Chemnrt™- t>-4T» (]993) in which , 0 5 cfu 
inocula sizes are introduced by mouth to the trachea. Strain differences are observed with 
respect to onset of disease e.g.,3-4 days for Swiss mice and 8-10 days for C57B 1/6. 
15 Infection yields in the lungs approach 10* cfu/lung. IP administration is also possible when 
genes mediating blood stream infection are evaluated. Following optimization of 
parameters of the infection model, the mutant bank normally comprising several thousand 
strains is subjected to the virulence test. Mutants with attenuated virulence are identified by 
hybridization analysis using the labelled tags from the "input" and "recovered" pools as 
20 probes as described in Hensel el a].. Science 269: 400-403( 1 995). S. pneumoniae DNA is 
colony blotted or dot blotted, DNA flanking the integrated plasmid is cloned by plasmid 
rescue in E. coli (Morrison et al., L3sslSlM. 159:870 (1984)) and sequenced. Following 
sequencing, the DNA is compared to the nucleotide sequences given herein and the 
appropriate ORF is identified and function confirmed for example by knock-out studies. 
Expression vectors providing the selected protein are prepared and the protein is configured 
in an appropriate screen for the identification of anti-microbial agents. Alternatively, 
genomic DNA libraries are probed with restriction fragments flanking the integrated 
plasmid to isolate full-length cloned virulence genes whose function can be confirmed by 
"knock-out" studies or other methods, which are then expressed and incorporated into a 
30 screen as described above. 

The individual full length sequences given herein are summarized in the following 
Tables (Table 1 and Table 2). Under the column in Table 1 labeled "Identity" there is the 
deduced identity of each open reading frame of the invention determined using Blastp 
and/or MPSearch. The ORF# column indicates whether the polynucleotide encoding each 
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ORF encodes more than one ORF. For example, SEQ ID NO:224 has an ORF# 1 but no 
#2; thus the polynucleotide encodes this ORF#l, but no other ORF was detected. On the 
other hand, SEQ ID NO:225 and 226 hav ORF#s 1 and 2 respectively, indicating that they 
were both encoded by the same polynucleotide. This can also be seen in Table 2 where the 
5 polynucleotide of SEQ ID NO: 2 encodes the ORFs of SEQ ID NOS: 225 and 226. Table 1 
also shows the start ("START" column) and stop ("STOP" column) codons for each ORF 
and their positions in the encoding polynucleotide sequence. The SEQ ID NOS of the 
polypeptides of this table are linked to both a deduced identity in this table and a 
polynucleotide sequence in Table 2 which encodes each polypeptide. The "Direction" 
10 column in Table 1 shows the direction of the ORF encoding each poypetide in this table. 
"Forward" denotes the sense orientation and "Reverse" denotes the antisense orientation of 
the ORF. 
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TABLE 1 









Codon 




Position 






ar,v? Ill F*\Jl 


Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


ZZ*t. 


Phosphate Transport ATP-Binding Protein 
PSTB. - Escherichia 


1 


-CAT 


TCA- 


2 


553 


Reverse 


ZZO. 


probable ATP binding protein - Bacillus 
subtilis 


1 


-CAT 


TCA- 


38 


781 


Reverse 


22o. 


Nopaline Transport System Permease 
Protein Nocm. - Aprobacienum Tumefaciens 


2 


-CAG 


TTA- 


781 


1512 


Reverse 


zz/. 


Aspartate Aminotransferase A (EC 2.6. 1 1 ) 
(Transaminase A) (ASPAT.) - Rhizobium 
Meliloii. 


1 


ATG 


TAG 


480 


671 


Forward 


ZZo. 


ISL2 protein - Lactobacillus helveticus i 1 


ATG 


TAA 


549 


947 


Forward 


229. 


ISL2 protein - Lactobacillus helveticus 


2 


ATG 


TAG 


889 


1353 


Forward 


230. 


Unknown 


3 


ATG 


TGA 


1421 


1570 


Forward 


231. 


Unknown 


2 


-CAT 


CTA- 


111 


413 


Reverse 


232. 


Unknown 


1 


ATG 


TGA 


133 


78? 


Forward 


233. 


Acyl Carrier Protein (ACP.) - Rhizobium 
Mel i lot i. 


1 


-CAT 


CTA- 


673 


906 


Reverse 


234. 


Unknown 


1 


-CAT 


TTA- 


15 


137 


Reverse 


235. 


Unknown 


*> 


-CAT 


CTA- 


68) 


989 


Reverse 


236. 


Sulfate Transport System Permease Protein 
CYST (Fragment). - Synechocystis SP. 
(Strain PCC 6803). 


2 


-CAT 


TTA- 


336 


689 


Reverse 


237. 


probable transposase (insertion sequence 
IS861) - Streptococcus agalactiae (str ain 

coH-n 


2 


-CAT 


TCA- 


149 


454 


Reverse 


238. 


Unknown 


2 


—PAH 


1 1A- 


JOI 


851 


Reverse 


239. 


"PTS SYSTEM 


1 


ATG 


TAA 


49 


477 


Forward 


240. 


"PTS SYSTEM 


2 


-CAT 


TTA- 


151 


789 


Reverse 


241. 


nitrogen fixation protein (nifS) homolog - 
Haemophilus influenzae (strain Rd KW 20) 


1 


-CAT 


TCA- 


1 


207 


Reverse 


242. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


1 


-CAT 


TCA- 


84 


431 


Reverse 


243. 


surface protein PspA - Streptococcus 
pneumoniae 


1 


ATG 


TAA 


22 


321 


Forward 


244. 


Unknown 


2 


ATG 


TAA 


272 


469 


Forward 


245. 




ATP-dependent Clp proteinase (EC 3.4.21.-) 
chain cipL - Lactococcus lactis subs p. lactis 


1 


ATG 


TGA 


83 


760 


Forward 
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plasmid pUCL22 
















DCia*VJlUCOSlUaSC \XH,J.^.1.ZJ ; 

(Gentiobiase) (Cellobiase) ) (Beta-D- 
Glucoside GL Ucohydrolase) (Amygdalasc). 

DdWHJUa i3Ul/llJ12>. 


1 


f~* AT 

-CA 1 


TCA- 


153 


599 


Reverse 


247. 


Glycine Betaine/L-Proline Transport ATP - 
Binding Protein Prov. - Escherichia Col. I. 


2 


ATG 


TAA 


159 


887 


Forward 




Unknown 


3 


-CAT 


TTA- 


1102 


12^4 


Reverse 


z*»y. 


Svb Kibosomal Protein S 1 1 (BS 11).- 
Bacillus Subtilis. 


1 


ATG 


TGA 


15 


164 


Forward 


OCA 

250. 


UNA - Directed RNA Polymerase Alpha 
Cham (EC 2.7.7.6) (Transcriptase Alpha 
cnam). - bacillus aubtius. 


2 


ATG 


TAA 


282 


1217 


Forward 




reptide Chain Release Factor 3 (RF-3). - 
Bacteroides Nodosus (Dichelobacter Nod 
Osus). 


1 


ATG 


TAG 


212 


667 


Forward 


252. 


Unknown 


1 


ATG 


TAG 


267 


353 


Forward 


253. 


Riboflavin Synthase Alpha Chain (EC 
2.5.1.9). -Bacillus Subtilis. 


2 


-CAT 


CTA- 


213 


662 


Reverse 


254. 


Unknown 


3 


-CAT 


TTA- 


833 


1045 


Reverse 


255. 


Unknown 


I 


-CAT 


TTA- 


83 


424 


Reverse 


256. 


mesl protein - Leuconostoc mesenteroides 


2 


ATG 


TAG 


448 


1302 


Forward 


257. 


Transacetylase BMTD (EC 2.3.1.-). - 
Bacillus Subtilis. 


1 


CTG 


TAG 


3 


320 


Forward 
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SEC 

258. 



Identity 

ribose-phosphate pyrophosphokinasc (EC 
2.7.6. 1.) - Bacillus caldolvticus 



ORF# 

1 



Start 

CTG 



Stop 
TAG 



Positi on 
Start 

1 



Stop 
642 



Direction 

Forward 



259. 



Unknown 



ATG 



TAA 



66 



260. 
261. 

262. 
263. 



Unknown 

nitrogenase C (nifC) homolog - 
Haemophilus influenzae (strain Rd KW20^ 
Unkown 
Unknown 



ATG 
ATG 

ATG 
ATG 



TAG 
TAG 

TAA 
TGA 



108 
631 

606 
280 



590 
855 

752 
495 



Forward 
Forward 



264. 



265. 

266. 
267. 



268. 



269. 



270. 



271. 



273. 



275. 
276. 



Unknown 



Acetyl Esterase (EC 3.1.-.-). - Caldocelium 
Saccharolyticum. 
Unknown 

Triosephosphasie Isomerase (EC 5.3.1.1 ) 
(TIM). - Bacillus Subtilis 



ATG 



TGA 



639 



Branched -chain Amino Acid 
Aminotransferase (EC 2.6. 1 .42) 
(Transaminase B). - ES Cherichia Coli. 



branched-chain-amino-acid transaminase 
homolog - Haemophilus influenzae (strain 
Rd KW20) 



DnaK protein - Lactococcus lactis 



Ketol-Acid Reductoisomerase (EC 1.1.1.86) 
(Acetohydroxy-Acid Isomeroreductase). 
Lactococcus Laciis (Subsp. Lactis) 
(Streptococcus Lactis) 
Unknown 



Amidophosphoribosyltransferase Precursor 
(EC 2.4.2.14) (Glutamine Phosphoribosy 
Lpyrophosphaste Amidotransferase) ( Atase) 
- Bacillus Subtilis 

Pyrrolidone-Carboxylate Peptidase (EC 
3.4.19.3) (5-Oxoprolyl-Peptidase). - STR 

Epiococcus Pyogenes. 

SOS Ribosomal Protein LI 6. - Mycoplasma 

Capricolum. 

serine O- acetyl transferase (EC 2.3. 1 .30) - 
Bacillus stearothermophilus 
Unknown 



ATG 

CAT 
-CAT 



TAA 

TCA- 
TTA- 



ATG 



TAG 



ATG 



TAA 



CTG 



ATG 



-CAT 



-CAT 



CAT 



ATG TAA 



TAA 



CTA- 



274 

2 
270 



110 



708 



3 

99 



CTA- 



TCA- 



278 



-CAT 



lATG 



CTA" 



TAG 



152 

156 

33 

577 

61 



1466 



Forward 



594 



Forward 



157 Reverse 
665 Reverse 



736 



Forward 



842 Forward 



749 



428 



631 



Forward 



775 

803 

416 

1194 

648 



Forward 



Reverse 



Reverse 

Reverse 

Forward 
Reverse 
Forward 
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278. 1 


Unknown 


1 


-CAT < 


ZTA- 


165 


335 Reverse 


279. 

< 
1 


lipoprotein oignai repuaase \e\~ j.^.zj.jo; 
Prolipoprotein Signal Peptidase) (Signal 
Peptidase ID (Spase Il>. - Staphylococcus 
Aureus. 


t 
i 


ATPi 
r\ 1 \J 


FA A 


JO 


517 


r orward 


280. 


Unknown 


i 


ATT 

AlU 


TA A 
1 AA 




534 


Forward 


281. 


Alpha-Acetolactate Decarboxylase (EC 
4.1.1.5). - Bacillus Subtilis. 


1 


-CAT 


CTA- 


104 


445 


Reverse 


282. 


Dihydrodipicolinate Synthase 9EC 4.2.1.52) 
(DHDPS). - Bacillus Subtil is. 


3 


-CAT 


TCA~ 


675 


884 


Reverse 


283. 


Polyribonucleotide Nucleotidyltransferase 
(EC 2.7.7.8) (Polynucleotide Phosphor 
Ylase) (Pnpase). - Photorhabdus 
Luminescens. 


1 


-CAT 


TCA- 


1 


855 


Reverse 


284. 


Single-Strand Binding Protein (SSB) (Helix- 
Destabilizing Protein). - Bacillus S Ubtilis 


1 


-CAT 


TTA- 


128 


598 


Reverse 


285. 


ATP-Dependent CLP Protease ATO- 
Bindinp Subunit CLPX. - Escherichia Coli. 


2 


-CAT 


CTA- 


195 


482 


Reverse 


286. 


ATP-Dependent CLP Protease ATO- 
Binding Subunit CLPX. - Escherichia Coli 


4 


-CAG 


TCA- 


676 


990 


Reverse 


287. 


N-fS'-Phosphoribosyi) Anthranilate 
lsomerase (EC 5.3.1.24) (PRAI). - 
Lactococcus Lactis (Subsp. Lactis) 
fStreptococcus Lactis). 


1 


ATG 


TAA 


83 


712 


Forward 


&oo. 


IOC PiKncrtmal Prntpin ^9 - F^chcrichia 

Coli. 


1 


ATG 


TGA 


277 


591 


Forward 


289. 


30S Ribosomal Protein S2. - Escherichia 
Coli. 


3 


ATG 


TGA 


918 


10641 Forward 

i 


290. 


Possible beta-ealactosidase precursor 


1 


-CAT 


CTA- 


120 


545 [Reverse 


291. 


Unknown 


1 


-CAT 


TTA- 


77 


283 


Reverse 


292. 


Unknown 


1 


-CAT 


TTA- 


1 


198 


Reverse 


293. 


"Phospho-2-Dehydro-3-Deoxyheptonate 
Aldolase 


1 


-CAT 


TTA- 


217 


690 


Reverse 
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i 




Codon 




Position 






SEQ ID NO: 


(Identity 


ORF# 


Start 


Stop 


Start 


Stop 


Direction 


294. 


Unknown 


2 


-CAT 


TTA- 


429 


599 


Reverse 


295. 


Unknown 


3 


-CAT 


TCA- 


739 


936 


Reverse 


296. 


Sorbitol Dehydrogenase (EC 1.1.1.14) (L- 
Iditol 2-Dehydrogenase). - Bacillus Sub 
Tilis. 


1 


ATG 


TAA 


64 


480 


Forward 


297. 


EBG Operon Repressor. - Escherichia Coli. 


2 


-CAT 


CTA- 


668 


1060 


Reverse 


298. 


cellobiose phosphotransferase system celA - 
Bacillus stearothermophilus 


2 


-CAT 


TTA- 


249 


566 


Reverse 


299. 


Unknown 


3 


-CAT 


TCA- 


581 


964 


Reverse 


300. 


Adenylosuccinate Lyase (EC 4.3.2.2) 
(Adenylosuccinase) (ASL) * Bacillus Suhril 
Is. 


1 


-CAT 


TTA- 


99 


809 


Reverse 


301. 


ATP- Binding Protein BEXA. - Haemophilus 
Influenzae. 


1 


ATG 


TGA 


111 


404 


Forward 


302. 


L-Lactate Dehydrogenase (Cytochrome) (EC 
1.1.2.3). - Escherichia ColL 


2 


ATG 


TAA 


337 


507 


Forward 


303. 


Phosphate Transport System Permease 
Protein PSTC. - Escherichia Coli 


3 


ATG 


TGA 


507 


1070 


Forward 


304. 


Sulfate Transport ATP-Binding Protein 
Cysa. - Synechococcus SP. (Strain PCC 794 
2) (Anacvstis Nidulans R2). 


1 


-CAT 


CTA- 


110 


838 


Reverse 


305. 


Unknown 


2 


-CAT 


TTA- 


838 


1077 


Reverse 


306. 


Unknown 


2 


-CAT 


TCA- 


282 


749 


Reverse 


307. 


Unknown 


2 


ATG 


TAG 


1108 


1290 


Forward 


308. 


Phosphoribosyiformylglycinamidine Cyclo- 
Ligase (EC 6.3.3.1 ) (AIRS) (Phosphoribo 
Syl-Aminoimidazole Synthetase) (AIR 
Synthase). - Bacillus Subtilis. 


2 


-CAT 


TCA- 


331 


477 


Reverse 




Unknown 


2 


ATG 


TAA 


313 


765 


Forward 


310. 


SMS Protein. - Escherichia Coli 


l 


— W\ 1 




102 


416 


Reverse 


311. 


sipma 42 protein - Enterococcus faecalis 


1 


-CAT 


TCA- 


8 


487 


Reverse 


312. 


Unknown 


1 


-CAT 


TTA- 


76 


390 


Reverse 


313. 


Guanytate Kinase (EC 2.7.4.8) (GMP 
Kinase). - Escherichia Coli. 


2 


-CAG 


TTA- 


415 


849 


Reverse 


314. 


Unknown 


I 


-CAT 


TCA- 


51 |296 


Reverse 


315. 


Unknown 


1 


ATG 


rAA 


175 bs* 


Forward 


316. 


Uknown 


2 


ATG ' 


rAA 


361 1 




Forward 


317. 


Uknown 


3 


ATG 


rAA 


383 1(144 


forward 
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318. 


TTS Svsicni 


2 


-CAT 


TCA- 


166 


465 


Reverse 


319. 


L-Fucosc Isomerase (EC 5.3.1.-) - 
c^cncncnia v^oii. 


1 


-CAT 


CTA- 


9 


482 


Reverse 


320. 


Unknown 


2 


-CAT 


7TA- 


495 


650 


Reverse 


321. 


Unknown 


1 


ATG 


TGA 


130 


231 


Forward 


322. 


3-Oxoacyl-[Acyl-Carrier Proiein] Reductase 

/T"*^^ till f\f\ \ f ~% If _t _ ■ A 1 r> T\ ' 

(EC 1. 1.1. 100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase). - Escherichia Coli. 


1 


-CAT 


TCA- 


51 


200 


Reverse 


323. 


3-Oxoacyl-lAcyI-Carrier Protein] Reductase 
(EC 1.1.1.100) (3-Ketoacyl- Acyl Car Rier 
Protein Reductase). - Escherichia Coli. 


2 


-CAT 


TCA- 


231 


6J4 


Reverse 


324. 


2-lsopropylmalate Synthase (EC 4.1.3.12) 
(Alpha-lsopropylmaJate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


1 


-CAT 


TTA- 


31 


231 


Reverse 


325. 


2-Isopropylmalate Synthase (EC 4. 1 .3.12) 
(Alpha-lsopropylmaJate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


2 


-CAT 


TCA- 


191 


334 


Reverse 




unknown 


i 

j 


tat 






41)2 


Reverse 


327. 


2-Isopropylmalate Synthase (EC 4.1.3.12) 
(Alpha-lsopropylmaJate Synthase) (Alph A- 
IPM Synthetase). - Lactococcus Lactis 
(Subsp. Lactis) (Streptococcus Lactis). 


4 


-CAT 


TTA- 


495 


1127 


Reverse 


328. 


Unknown 


2 


-CAT 


TCA- 


170 


469 


Reverse 


329. 


Unknown 


1 


ATG 


TAG 


105 


473 


Forward 


330. 


Strscaa Ncbi gi: 310629NCB1 gi: 473 - 
Streptococcus eordomi (strain PK488) DNA 


1 


-CAG 


CTA- 


81 


665 


Reverse 
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SEQ ID NO: Identity 



ICodon 



ORF#)Start 



Stop 



Position 
Start 



Stop 



Direction 



33). 
332. 



Unknown 



ATG 



TGA 



228 



374 Forward 



lysyl aminopcptidase (EC 3.4.1 1.15) 
precursor - Lactococcus lactis 



-CAT 



TCA- 



107 



766 Reverse 



333. 



Indoie-3-GlyceroI Phosphate Synthase (EC 
4.1.1.48) (IGPS). - Lactococcus Lactis 
(SUBSP. Lactis) (Streptococcus Lactis) 



-CAT 



TCA- 



127 



369 



Reverse 



334. 



Anthranilate Phosphoribosyltransferase (EC 
2.4.2.I8).-Lactococcus Lactis (Sub SP. 
Lactis) (Streptococcus Lactis). 



-CAA 



TCA- 



366 



641 



Reverse 



335. 



Tagatose-6-Phosphate Kinase (EC 2.7.1.-) 
(Phosphotagatokinase). - Lactococcus L 
Actis (Subsp. Lactis) (Streptococcus Lactis) 



ATG 



TGA 



42 



524 



Forward 



336. 



337. 



Unknown 



ATG 



TGA 



Unknown 



-CAT 



TTA- 



73 



32 



474 



Forward 



338. 



339. 



Unknown 



Acetolactaie Synthase Large Subunit (EC 
4. 1.3. 1 8) ( AHAS) ( Acetphydroxy- Acid Syn 
thase Large Subunit) ( ALS). - Lactococcus 
Lactis (SUBSP. Lactis) (Streptococc us 
Lactis). 



ATG 



TGA 



ATG 



TAG 



306 



59 



572 



502 



Forward 



Forward 



340. 



Penicillin-Binding Proteins I A/IB. - 
Bacillus Subtilis. 



ATG 



TGA 



535 



720 



Forward 



341. 



Unknown 



ATG TAG 



165 



488 
394 



342. 



Anthranilate Phosphoribosyltransferase (EC 
2.4.2. 18). - Lactococcus Lactis (SUB SP. 
Lactis) (Streptococcus Lactis). 



-CAT TCA- 



Forward 
Reverse 



343. 



344. 



grpE protein - Lactococcus Lactis 



Unknown 



ATG 



-CAT 



TAA 



124 



CTA- 



543 



Forward 



345. 
346. 
347. 
348. 
349. 
350. 
351. 



Unknown 
Unknown 
Unknown 
Unknown 
Unknown 
Unknown 
NIFS Protein Homolog (Fragment). 
Lactobacillus Delbrueckii (SUBSP. 
Bulgaricus). 

Ornithine Carbamoyl transferase 



-CAT 
ATG 
-CAT 
-CAT 
CAT 
CAT 
ATG 



TTG 



TTA- 

TAA 

CTA- 

CTA- 

CTA- 

TTA- 

TAG 



TAA 



377 
159 
315 
468 
482 
380 
87 



195 
544 
503 
644 
587 
631 
757 
797 



Reverse 
Reverse 
Forward 
Reverse 
Reverse 
Reverse 
Reverse 
Forward 



502 j Forward 



37 



WO 97/37026 



PCT/US97/05306 



353. 1 


PSEGLI NCBI gi: 499660 - Pseudomonase 

mif iHn 

JUUUd. 


1 


-CAT ' 


TTA- 


50 


481 


Reverse 






1 


CTG 


TAA 


1 


522 


Forward 


355. 


Thiortdoxin. - Strcptomvces Clavulijrcnis. 


1 


ATG 


TAG 


58 


375 


Forward 




P\ nlMfilaiA iiamiaiir* //fan A \ nnMAlAD _ 

U- AJalUnC pCimCaSC luag/\; nuiiKJiug - 
Un^mnnhiltic inflnpnTA^ ( (train Rd KW201 


j 


ATG 


TAG 


185 


343 


Forward 


jj r , 


I TnVnnu/n 


| 


-CAT 


TTA- 


78 


371 


Reverse 


358. 


"DNA Polymerase III 


1 


-CAA 


TCA- 


82 


561 


Reverse 


*a en 


i ranspon a i r-oinoing rrotcin i^oma. - 
Streptococcus Pneumoniae. 


] 


-CAT 


TTA- 


112 


552 


Reverse 


3CK). 


Unknown 


2 


ATG 


TAA 


445 


540 


Forward 


361. 


surface protein PspA - Streptococcus 
pneumoniae 


I 


-CAT 


TCA- 


45 


416 


Reverse 


362. 


Licid Protein. - Haemophilus Influenzae 


2 


ATG 


TAA 


268 


633 


Forward 




UnKJlOWTI 


2 


ATG 


TAA 


527 


640 


Forward 


364. 


Glutamine Transport ATP- Binding Protein 
GLNQ. - Escherichia Coli. 


1 


CTG 


TGA 


1 


393 


Forward 


365. 


Unknown 


1 


ATG 


TAG 


184 


303 


Forward 


366. 


Unknown 


1 


ATG 


TGA 


794 


919 


Forward 


367. 


Lipoamide Dehydrogenase Component (E3) 
of Pyruvate Dehydrogenase Complex (EC 1. 
8.1.4) (Dihydrolipoamide Dehydrogenase). - 
Azotobacter Vinelandii. 


1 


ATG 


TAA 


3 


416 


Forward 
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Codon 



Position 



SEQ ID NO: 



Identity 



ORF< 



Start 



Stop 



Start 



Stop 



Direction 



368. 



Orotate Phosphoribosyltransferase (EC 
2.4.2.10) (OPRT). ♦ Bacillus Subtilis. 



-CAT 



CTA- 



21 



311 



Reverse 



369. 
370. 



Unknown 



ATG 



TAA 



28 



309 



Forward 



SPOOB-Associated GTP-Binding Ptroein. 
Bacillus Subtilis. 



ATG 



TGA 



110 



538 



Forward 



371. 



Transport ATP-Binding Protein COMA. - 
Streptococcus Pneumoniae. 



-CAA 



TCA- 



47 



679 



Reverse 



372. 
373. 



prephenate dehydrogenase (EC 1.3.1.12)- 
lactococcus lactis. 



ATG 



TAG 



271 



492 Forward 



O-Syaloglycoprotein Endopeptidase (EC 
3.4.24.57) (Glycoprotease). - Pasteurella 
Haemolvtica. 



CTG 



TAA 



450 



Forward 



374. 



375. 



376. 



Unknown 



•CAT 



TTA- 



Unknown 



TTG 



TAA 



'DNA Polymerase in 



ATG 



TGA 



282 



Reverse 



265 



Forward 



Forward 
Forward 



377. 



Unknown 



ATG 



TAA 



361 



378. 



Dihydrodipicolinate Synthase (EC 4.2.1.52) 
(DHDPS). - Bacillus Subtilis. 



-CAT 



TTA- 



179 



457 



Reverse 



379. 



Unknown 



CAT 



TCA- 



104 



380. 



381. 



382. 



Unknown 



-CAT 



TCA- 



Unknown 



248 



CAT 



Unknown 



TTA- 



56 



-CAT 



TTA- 



649 



Reverse 



394 



Reverse 



409 
617 



383. 



384. 



Unknown 



ATG 



Unknown 



TAG 



138 



CAT 



TTA- 



225 



Reverse 
Forward 



479 Reverse 
857 (Forward 



385. 



Unknown 



ATG 



TAA 



386. 



Unknown 



ATG 



TAG 



387. 



Unknown 



ATG 



TGA 



443 I Forward 
544 Forward 



388. 



Unknown 



-CAT 



CTA- 



389. 



Unknown 



CTG 



390. 



Unknown 



-CAT 



TTA- 



309 

3 



391. 



Phosphopemomuiase (EC 5.4.2.7). - 
Escherichia Coli. 



ATG 



TAA 



452 
233 



Reverse 
Forward 



D-Alanyl*D-A!anine Carboxypeptidase 
Precursor (EC 3.4.16.4) (DD-Peptidase)(DD 

CarboxypeptidaseKCpase)(PBP5). - 
Bacillus Subtilis 

Na+ and Cl-dependent gamma-arrunobutryic 
acid transporter homolog-Haemophilus 



-CAT 



TTG 



TTA~ 



TGA 



52 



537 



268 



Reverse 



Forward 



39 



WO 97/37026 



PCT/US97/05306 





influenzae (strain Rd KW20) 














394. 


Unknown 


2 


ATG 


TGA 


319 


546 


Forward 


395. 


dihydrolipoamidc dehydrogenase (EC 
1 .8. 1 .4) - Pelobacter carbinolicus 


1 


ATG 


TGA 


3 


284 


Forward 


396. 


Unknown 


2 


ATG 


TGA 


241 


450 


Forward 


397. 


Beta-Glucosidase A (EC 
3.2. 1 .21 XGcnuobiaseKCellooiaseKBeta-D- 
Glucoside Glucohydrolase). - Clostridium 
Tbermocellum. 


1 


ATG 


TAA 


184 


453 


Forward 


398. 


Uracil Permease. - Bacillus Caldokvticus. 


I 


ATG 


TAA 


93 


353 


Forward 


399. 


Unknown 


2 


ATG 


TAG 


127 


516 


Forward 


400. 


lijroendopeptidase F- Lactococcus lactis 


I 


ATG 


TGA 


134 


310 


Forward 


401. 


StratpaseaNCBlgi: 153565NCBI gi: 4- 
Streptococcus Faccaiis DNA. 


2 


ATG 


TAA 


392 


568 


Forward 


402. 


Unknown 


2 


ATG 


TAA 


376 


507 


Forward 


403. 


Exodeoxyribonuclease Small Subunit (EC 
3.1.1 1.6) (Exonuclease VII Small Subunit). - 
Escherichia Coli. 


2 


-CAT 


TCA- 


470 


682 


Reverse 


404. 


ATP-Depenent DNA Helicase RECG (EC 
3.6.1.-). - Escherichia Coli. 


I 


ATG 


TGA 


3 


455 


Forward 



40 



WO 97/37026 



PCT/US97/05306 



SEQ ID NO: 

405. 



Identity 

Possible thiamin biosymhetic enzvme 



ORF# 

I 



Codon 
Start 

•-CAT 



Stop 
TTA- 



Position 
Start 

15 



Stop 

347 



Direction 
Reverse 



406. 



SPOU Protein. - Escherichia Coli. 



ATG 



TAA 



322 



618 



407. 



Maionyl Coa-Acyl Carrier Protein 
Transacylase (EC 2.3.1.39). - Escherichia 
Coli. 



ATG 



TAG 



85 



498 



Forward 



408. 



409. 



410. 



Unknown 



-CAT 



nucleoside diphosphate kinase (ndk) 
homolog- Haemophilus influenzae (strain 
Rd KW20) 



CTA- 



17 



CTG 



TGA 



Nucleoside Diphosphate Kinase (EC 
2.7.4.6)(NDK) (NDP Kinase). - Escherichia 
Co LI. 



ATG 



TAG 



215 



118 



Reverse 



159 



Forward 



481 



Forward 



411. 



412. 



Unknown 



Unknown 



■CAT 



TTA~ 



21 



•CAT 



TCA- 



162 



368 



Reverse 



413. 



414. 



Unknown 



Unknown 



ATG TAA 



ATG 



187 



TGA 



316 



4 1 7 Forward 
4 1 7 Forward 
453 Forward 



415. 



416. 



417. 



Unknown 



Enolase (EC 4.2.1.1 l)(2-Phosphoglyccrate 
Dehydratase)(2-Phospho-D0Glycerate 
Hydro-Lvase). - Bacillus Subtilis. 



Unknown 



ATG 



TGA 



316 



-CAT 



TTA- 



CTG 



TGA 



435 



Reverse 



418. 



Unknown 



ATG 



TGA 



419. 



420. 



PILB Protein. - Neisseria Gonorrhoeae. 



-CAT 



Unknown 



TTA- 



145 



ATG TGA 285 



327 



Reverse 



533 
379 



421. 



422. 



Unknown 



CTG 



integrase/recombinase (xprB) homolog * 
Haemophilus influenzae (strain Rd KW20) 



ATG 




TAG 



TAA 



305 



41 



Forward 



Forward 
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Subnlis. 1 












All 


glycosyl transferase (IgiD) homolog - 
Haemophilus influenzae (strain Rd KW20) 


i 
l 


ATG 


TT5A 




J/7 


rurwaru 


434. 


Unknown 


2 


-CAT 


CTA- 


244 


435 


Reverse 


435. 


Unknown 


2 


ATG 


TGA 


118 


309 


Forward 


436. 


Factor essentia] for Expression of 
MethiciiUn Resistance. - Staphylococcus 
Aur Eus. 


2 


-CAT 


TTA- 


168 


434 


Reverse 


437. 


Unknown 


1 


-CAT 


TCA- 


2 


214 


Reverse 


438. 


Unknown 


1 


TTG 


TAA 


2 


142 


Forward 


439. 


Unknown 


1 


ATG 


TAA 


30 


191 


Forward 
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SEQ ID NO 

440. 



Identity 



ORF# 



Codon 
Start 



Stop 



Position 
Start 



Stop 



Direction 



Aspartate Aminotransferase (EC 2.6. 1.1) 
(Transaminase AKASPAT). - Bacillus SP. 
(STRAIN YM-2). 



-CAT 



TCA- 



63 



416 



Reverse 



441. 
442. 



Unknown 



ATG 



TAA 



52 



342 



Forward 



443. 
444. 



Unknown 



-CAT 



Unknown 



TCA- 



210 



TTG 



TGA 



455 



Reverse 



517 



Forward 



ATP-De pendent DNA Helicase RECG 
(EC 3.6. 1 .-) - Escherichia Coli. 



ATG 



TGA 



83 



376 



Forward 



445. 



Unknown 



ATG 



TAA 



70 



384 



446. 



Cell Division Protein FTSA. - Bacillus 
Subtilis. 



TTG 



TAA 



371 



Forward 



447. 



Unknown 



ATG 



TAG 



70 



448. 



449. 



450. 



451. 



Unknown 



Unknown 



Unknown 



Unknown 



ATG 



TAG 



104 



CTG 



TAA 



ATG 



ATG 



TAA 



TGA 



120 



31 



454 



159 
347 



Forward 



Forward 



Forward 



452. 



Unknown 



ATG 



TGA 



225 



Forward 
Forward 



453. 



Unknown 



ATG 



TGA 



290 



454. 



Unknown 



ATG 



TGA 



Forward 
Forward 



455. 



JAG Protein (SPOII1J Associated Protein). 
Bacillus Suhtilis. 



ATG 



TGA 



93 



365 



456. 



457. 



'DNA -3-MethyIadenine Glycosidase I (EC 

3.2.2.20)(3-Methyladenine-DNA Glvcosylas 
EI 



ATG 



TAG 



91 



"Glucan 1 



-CAT TTA 



282 



Forward 



458. 



459. 



Unknown 



CAT 



Glutamate/Aspartate Transport ATP- 
Binding Protein GLTL. - Escherichia Coli 



CAT 



CTA- 



245 



TCA- 



8) 



150 
400 



Reverse 
Reverse 



218 



Reverse 



460. 
461. 
462. 
463. 
464. 
465. 
466. 
467. 



Unknown 
Unknown 



Unknown 
Unknown 
Unknown 
Unknown 
Unknown 

Llcpyrda NCBI gi: 5 1 1014 - Lactococcu7 
lactis. 



-CAT 
ATG 



TTA- 
TGA 



103 
305 



CAT 
-CAT 
-CAT 
-CAT 
ATG 
ATG 



TCA- 

TTA 

TCA- 

TTA- 

TGA 

TGA 



29 

572 

652 

318 

719 

134 



492 
484 



Reverse 
Forward 



355 

838 

1026 

764 

805 

472 



Reverse 
Reverse 
Reverse 
Reverse 
Forward 
Forward 
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468. 


Unknown 


1 


ATG 


TGA 


385 


492 


Forward 


469. 


Unknown 


2 


ATG 


TAA 


587 


721 


Forward 


470. 


galE protein - Neisseria meningitidis 


1 


-CAT 


TCA- 


23 


460 


Reverse 


471. 


Unknown 


2 


-CAG 


TTA- 


717 


1319 


Reverse 


472. 


Naphthoate Synthase (EC 4. 1 .3.36) 
(Dihydroxynaphthoic Acid 
SynthetaseHDhna Sy Nthetase). - 
Escherichia Coli. 


1 


ATG 


TGA 


97 


414 


Forward 



44 
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SEQ ID NO: 

473. 
474. 


Identity 

Unknown 
Unknown 


ORF# 
1 


Cod n 
Start 

CTG 


Stop 

TGA 


Position 
Stan 

1 


Stop 

246 


Direction 
Forward 


475. 
476. 
477. 


Unknown 
Unknown 

Multiple Sugar-Binding Transport ATP- 
Binding Protein MSMK. - Streptococcus 
MUTA NS. 


1 
1 
1 

2 


-CAT 
-CAT 
-CAT 
ATG 


TCA- 
TTA- 
TTA- 
TAA 


341 
217 
499 
407 


748 
858 
729 
571 


Reverse 
Reverse 
Reverse 
Forward 
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Table 2 shows the correlation between the SEQ ID NO of each DNA sequence of 
the invention with the SEQ ID NO(S) of polypeptide or polypeptides that its open reading 
frame(s) encodes. For example, the DNA of SEQ ID NO: 1 encodes one polypeptide, that 
of SEQ ID NO:224. Whereas, the DNA of SEQ ID NO:2 encodes two polypeptides, the 
polypeptides of SEQ ID NO:225 and SEQ ID NO:226. 



TABLE 2 





PrrttAin ( nrw»n r/*nrtinc7 framp^ 










i 

1 


99A 




2 






3 


997 




4 


99c 990 9^n 




c 

J 


9^1 




o 


9^9 




f 


9^ 




o 
o 


9^d 




o 


9^ 




in 






1 1 
1 1 


z. J / 




I £. 


9^8 




1 J 


9^Q 




14 

1 *T 


240 




15 


241 




16 


242 




17 


243,244 




18 


245 




19 


246 




20 


247,248 




21 


249,250 




22 


251 




23 


252 




24 


253,254 




25 


255,256 




26 


257 




27 


258 




28 


259 




29 


260,261 




30 


262 
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63 
\64 



66 
[67" 
[68~ 
|69~ 

po" 
m~ 
vrT 



301 



302,303 



165 304,305 



306 
307 
308 
309 
310 
311 

312,313 



31 


263 




32 


264 




33 


265 




34 


266 




35 


267 




36 


268,269 




37 


270 




38 


271 




39 


272 




40 


273 




41 


274 




|42 


275 




J 43 


276 




[44 


277 




|45 


278 




|46 


279 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4478 

(B) TELEFAX: 610-270-5090 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CTCAAACATG GTCTCTTCTA GTTGCATGGT CGCAATCGGA TCCAAGGCTG AGGCTGGGCT 
CATCCATTAA GAGGATATCT GGCTTAACAG AGATGGCACG AGCGATACAG AGACGTTGTT 
GCTGACCACC TGATAAGGTC AAGGCTGACT TGTGGAGATC GTCTTTAACC TGATCCCAAA 
GGGCAGCCTG ACTAAGGGAG GTTTCTACGA TTTCATCTAG GACTTGCTTA TCCTTAACTC 
CAGCACGTTC ATGCGCAAAG GTAATATTAC GGTAAATTGA CTTAGCAAAT GGATTCGGGC 
GTTGAAAAAC CATTCCAATG TGTTTACGCA TTTCATAAAC GTTGATTTCT GGACGGTTGA 
CATCAATTCC ACGATAGAGA ATCTGCCCAG TTACTTTAGC AATATCAATG GTATCATTCA 
TGCGATTGAG ACTGCGTAAG TAGGTAGATT TCCCCGATCC CGACGGACCA ATCAAAGCTG 
TAATTTTATT TCCTTTCAAA TTGCATATCA ATCCCCTTAA TGGATTCATT TTTACCATAG 
TAAACATGGA CATCCTTAGT AGAAAGGGCT ACTTTTTCTT CAGGAAAGGT AAGGATATGC 
TTCTCATCCC AGTTATATGT TGACATGGCT TCTCCTTTAG GCAGCGGTTA ATTTCTTGTG 
TAGATAGCTT CCGAACTTAC GAG 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
683 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 
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pm a prpnr IIP 
w 1 Au 1 L.L AAL, 




ttppttt atp 

X 1 ww 1 X i V- 


T ATT AT ATP A 

X f \ X X J. *» x W *A 


AATATAAGTC 


CGTTTG T AAC 


60 




1 1U1 1 1 A O 1 w 


PGPTPTTPTT 

WVJ W A V— X X w X X 


TAGGGGTGTG 


GATAATCTCA 


TCCGGAGTTC 


120 




p a TTTTPPPr 
unl 1 1 i\^www 


TTATPTAAGA 


AGAGAATTTT 


ATCCGCAACT 


TGGGCTACAA 


180 




ATP. iptt: arr 


AAAATPATGG 


TCTGACCTGA 


CTTAGCAGCA 


TC TGC AAT AG 


240 


Aw 1 1 1 i v_ 1 AL. 


ttpappp.app 


AATTPTGGGT 

i"UAl 1L1 vjvjVJX 


C AAGGGC TG A 


AGTTGGTTCG 


TPT AAG AGP A 




A A TV. P"" J\ rppTPP 


TTTPATAGP A 


agphpapgpg 


CTAGGGCAAC 


CCGTTGCTTP 


TGTPPAPPTG 


360 


AT AAA IajLjwVj 


Ti. r*p TLTh'h tpp 

Avjo A 1 AA lvjvj 


ttttp a ppht 

111 ivnLuu 1 


rorjAAAGcrp 


AAPP T T AGP P 


A APTPTTPPT 




IvjbLAAI LI 1 


Av3 1 1 l\. 1 


J. Vjvj 1 V_./AVJ/A 1 JA 


ATTTPTTGAP 


AAP AACP AAG 


PPTTPTTTPA 


480 


/-i t\ mrp A fT*t~* Ti Ti p 
wA X 1 A 1 LAfty 


tpptyittpp^ 


PnTTP A A A T A 


AATTAAACTG 


TTGGAAAACC 


ATAGAPAAPT 


540 


TACGCiLb X Avj 


r^r^r^ti a p"V"? A IT 
ouLnnubn J. 1 


TPTTfTTf; AG 


T^ ATTTTAGA 


AAAATPAAPT 


GAAAAAPPAT 


600 


CAATCTGAA 1 


ALjALjwL. AL, 1\j 


tp tapptpttt 

1 LAuu 1 vj 111 


P A Af2 ATA ATT 


GAnAPTT^PGA 


AG A AAGGTTG 


D O V 


ATTTTC C AG L 


J.A_L- 1 LjAALjAA 


LUnA 1 wAAVjLa 


PTAPA APTTP 


LV^V^l ill lOn 


ATATPP A AGT 


770 


TCAGATGATC 


C AAGAC AG 1 V- 


I Vj 1 V_L. I\jAAA 


AvjAjA 1 1 IajL 1 


rn a A AfVTVT* A A 
1 AAA 1 lUuAA 


ZITPTT1 ATT" A 
nlLl lAAlwA 


( Ov 


TTAACGAAGG 


TCTCCTTTCA 


CATCTvj»™1\L La 


A /^•TV^Ti A TV A 

L. AV_ 1\J7 1 A It- A 


rympp a r^* A A A 


lAbLLAl 111 


O u 


TCTCTCGATG 


AAACGACCGA 


GGCTTTv- AA 1 




At- 1 A\- LUAA 1 


AAA w AALjLsoL. 


qnn 


AACAGAGATG 


AAGCGTTCAA 


AATAGCGATA 


A IPPRPPTPP A 


1 AbAA 1 L. I 


LiAo-w 1 1 LiwVj w 


QfiPt 


AAAGACTTCC 


ACAACACCCG 


C ACTAAAAG C 


*Tt A A Z™' A rTV'*IIM 1 1 

TAGAOATGTM 


CC C TTvjvjTQ, A 


AALLoA l\3AVj 




GGAATTAATC 


AAGGTTGGAG 


T AGCTAC C Aw 


CLrL. 1 bv-A i 1 A 


buAA 1 AA 1 \- A 


L. l\.b 1 LbAlA 


1 flftfi 

1UOU 


AACTTGCGCT 


CGGGTCATAL7 


CLAbAL 1 LA-Vj 




A TP TP A PP A P 
AIL 1 L. AL,\_ AL? 


P ATT A APTP^ A 


XX*M\J 


GAGAATGGCT 


GCACGAATGG 


TTTCACTAGC 


ATAAGCTGCC 


TCATTAAAGG 


CAAAAGCGAC 


1200 


AATCGCAAAA 


GCTGCAGCTG 


GAATCGCATT 


GATATTGAGA 


CCAGTTCCCC 


ATTGCTGATT 


1260 


GAGGGCTTTC 


AAAGCCAAAG 


GGATTCCGTA 


GTAGGTCAAC 


ATGAGTTGCA 


CCAAAATCGG 


1320 


TGTCCCTTTT 


AAGAAACTAA 


CAAAGAAGGC 


CTGCAAGGGA 


TATAAAATCT 


TGACACGATT 


1380 


GATCTTCACA 


ATGGCAAAAA 


GAAGCGCCAA 


AACCAAGCCA 


AAAAGGGCAC 


CGCCAATTGT 


1440 


CAACATAATT 


GTTGTTGGAA 


GTTGTTGGAC 


AATTCTAGGG 


ATTCCATCAA 


AGACCGAACG 


1500 


TAGGCTAAAC 


AG 










1512 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CTGGAGATTA TTGAGATTGC CCGTCAAAAC GATTTGATTA TNTTTGCGGA TGAAATCTAT 60 

GACCGCATGG TAAATGGACG GACATGTGCA TAACGCCTGT GGCGAGCTTG GCACCAGATG 120 

TCTTCTGTGT CAGCATGAAT GGTCTGTCAA AATCCCACCG CATAGCAGGT TTCCCGTGTG 180 

GGATGGATGG TCTTGTCTGG CCCTAAGACT CATGTTAAGG GCTATATCGA AGGGCTCAAT 240 
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ATGCTGTCCA ATATGCGCCT TTGCTCTAAC 
ATCACTGTCG TACAAACTTG CGCTTGGGGG 
CTGGTGGACG AATCTACGAG CAAAGAAATT 
GTTTGTCTGC CGTTAAACCC AAGGCGGGGC 
TGTACCGTAT CGATGATGAT GAGCAGTTTG 
TCTTGGTTCA TGGTCGAGGC TTTAACTGGC 
TTCCTCGTGT TGATGAGTTA GCCCAAATCC 
ATCGTAGATA GGGCTTGCAT TCGAAAAAGC 

(2) INFORMATION FOR SEQ 
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GTTTTGGCTA 


CCAACGCTGC 


ATTAGGAATA 


inn 


GTCACCAATC 


AGTCGATGAA 


TTGCTTCTTC 


t fin 


TCATCTATAA 


TGCCATTCAA 


GATATTCCAG 




TCTATATCTT 


CCCAAAAATC 


GACCGCAATA 


480 


TCCTTGATTT 


CTTGAAGCAG 


GAAAAGGTTC 


540 


AGGAACCAGA 


CCACTTCCGT ATCGTTTACC 


600 


AAGAAAAGAT 


GACTCGTTTC 


TTGAAACAGT 


660 


TGGAAACATT 


TGCCTAGAG 




709 



ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



CTGCTTCAGA 
TAATACCTCT 
AGCTTTCGAG 
TCTTTCTATT 
TTAGTCATTT 
TACTCTATAA 
ATAACCGTAT 
TCTTCCGTGA 
GCCTCAGCTT 
GTAGTTCTAT 
TTGGTGTTCA 
TTAAACACGC 
CTCTTCAATA 
ACGAAAGCAA 
TTACGATTTC 
TACAAATCAA 
GCTATGAAGG 
GTGAACTATA 
GGAAAAATCT 
ACTCCACGTA 
GCGCTATCCA 
ATGTTTTCAA 
GCTGGCATTA 



ACCTGCCAAA 
TTGGGAGTGT 
TTTCTTGGCT 
TTTTCTNCTA 
TCTTGATGTA 
AATCTTTTCC 
TATCTAAAAC 
TATTTTATTT 
CAATATTTCT 
GAATGATGAA 
GCGCACGACT 
AAAAGGTGGA 
TGTGCGAGAA 
CTTAATCCGT 
AAGAACTCCT 
TCGCCCTAAA 
CTCAAGCGAT 
GTCATGATAT 
TGGCTGATAG 
AATCCAGCAA 
AGGAGAGAAG 
CAACCTATCG 
TCAATTATGA 



CCAGCACCGA 
GAAGTTAAGA 
CAGGCTGAAA 
CTAGTATAAC 
AAGAGATAGT 
ACATAACGGA 
AGCATAACAT 
TACCTGAAAA 
TACTTTCCAA 
GCAAGTAAAC 
TTTGAAGAGA 
CGAAAACCTA 
TATCGAACTT 
CGGAGCCAAT 
CTCAGTTCTG 
AAAAGAATTA 
TGTCACAAGT 
GAAGTTGTTC 
TGGTTATCAA 
ACTCAAGCCG 
CAAGGTTGAG 
AAATCATCGT 
ACTAGGATTC 



TAACATTGAT 
TTCACATTGA 
AAGTCCACAG 
AAAAAAAGGG 
GAGTATTCCA 
TCGATAGGGA 
TCAACATAGT 
AATAGCACAC 
TAGACTCCCT 
AACTAACTGA 
TATTAGCTGT 
AATTAAGCCT 
ATGAACAAAT 
GGGTTGAAGT 
AGGACACGGT 
GCGAATCATT 
CAAGGGAGAA 
AAAATGAGTT 
GGGCCCATGA 
CTAATAGCTG 
AACATCTTTG 
AAACGCTTCG 
TAGTTTTGCA 



ATAAGATTGA 
AAAAGCCAAT 
GGCTTTTNCA 
AAGAAGGNAA 
GTTAAGAATC 
CTGTTATTCT 
GATCTCCTTT 
GCTCACAATT 
GCGAAACAAA 
TGCACGATTT 
ATTAAAAACA 
AGAAGACCTT 
TGCGGCTGAT 
AACTCTTGTT 
AATGATTGAT 
CTGGTAAAAA 
TTGTTTCTTT 
GCAGAAATAT 
AGATATATCC 
AAGATAAAGC 
CCAAAGTAAA 
GATTACGAAT 
GGAAGTCTAT 



GACACGACAC 
CAGACTTACA 
CTCCCACAAA 
ACTTCCCTGT 
AATTATATGC 
ATCTCTTGCT 
AAACTGTGAA 
CTTCCTAATA 
ATATGGTATA 
AAGCGTCTTG 
GCTTATCAAC 
CTTATGGCCA 
TTTGGTATCC 
CAAAGTGGTG 
GCGACGGAAG 
GAAATTTCAC 
GGATATCGCT 
CGGACAAGCT 
TCAAGCACAA 
TTATAACCAT 
AACGTTTAAA 
GAATTTGATT 
TATTTTCCTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
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ATTGTCTGTA AGTCTACTGA CCTTGTTGTT TATCCCAGTC ATGGTTTCTA GTTCGGGCTC 1440 

AGAGTTTCAA AGTGGATGGC AAGAGCATCA ATTGATTGCT GAGAAGGTTA GTAAAACACT 1500 

TGACAAGACA TTTGATAAGG ATGTCAGAAA AATTCCGACC AGTCCAGTTT TATCAAAAAT 1560 

TTGTAGATGA GATGGGAAGG ATTTACTC 1588 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



CTGTTTTTAA 


TACAGCTAAC 


ATCTCTTCAA 


AAGTGGTACG 


CTGAACACCA 


ACAAGACGCT 


60 


TAAATCGTGC 


ATCAGTTAGT 


TGTTTACTTG 


CTTCATCATT 


CATAGAACTA 


CTATACCATA 


120 


TTTTGTTTCG 


CAGGGAGTCT 


AATATTGTCA 


AATACTGGAG 


CGCTCATTGC 


TGGTATACGG 


180 


AATAAGATTG 


GCCCAGCTTC 


GATAACTGGG 


ATACCTGGTT 


CAAAACCAAG 


ATCTGTTGCA 


240 


GCGATTGGTG 


TAAAGATATC 


GTAACCTTTC 


ATAAGGTCTT 


CGTTTACATC 


TTTCACCATG 


300 


ACTGCATCAC 


AGTGAACATC 


ATAACCACGG 


TTTGAAAGTT 


CTTCTTCTAG 


AGCACTTTTA 


360 


ATTTGGTGAC 


TTGAGTTAAC 


ACCTGCACCG 


CAGGCAGCAA 


GAATTTTAAT 


CATTTAGATT 


420 


TCCTCCGATT 


TTATTTTTTA 


ATAGACAAGA 


TTAAGCGGTT 


GCTTCAGCAA 


TGTAAGCATA 


480 


AAGTTTTTCT 


GGTTCGGAAA 


TTTTTGATAG 


GTCTTCAAGA 


TGTCCATTTC 


CTGTGAAAAA 


540 


GTCCATCAAC 


TGAGCCAGAA 


TATTTGTTTG 


ACTTGAACTT 


GAGTTATTGA 


TGATAAAGAA 


600 


GAGCAAGGAT 


ACTTCTACTT 


CCTTATCAGG 


AGCTATCATA 


TTGTGAAAAG 


TTACTGATTT 


660 


TTCTAATCGA 


ACAACCACCA 


CTTTTCTCAG 








690 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



ACTGCTACCT TTAAGAAGAT AGTAGACGTA TATACTTTTT TAAGAAAATC AAAAAGATAC 
TATAAAAAAT CTATTTGTTT ATTGAATTTA AGACTTTGGT AACAAATTGA AAATAAAAAG 

56 
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GAGGTATTCA TCATGAATAC AAAAATGATG 
CTTGCTTGCG TTGAAGGTGG AGATATTGAT 
GTTGCATATG GCGCAATTGA TGGGTGTGCA 
TTGCTATAGG AATAGGTGTA ACTGGTGCTG 

(2) INFORMATION FOR SEQ 
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TCACAATTTT CTGTTATGGA TAATGAAATG 
TGGGGAAGAG AAATTAGTTG TGCAGCAGGG 
ACAACGGTTT GATATTTCTA TTGGGACCAT 
CAGGTGGAG 

ID NO: 7; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



CTGAAGACCT 
TCCGCCACCT 
TTTTGTATTC 
ACAAGTTTAG 
AACGTCTTGA 
TAATCACTTT 
ATACCAGCAA 
GAAAACATTT 
GATAGCGCAT 
CGTGGAAGTT 
GATTTTACCA 
GAACTCCGAT 
GTCAATTTCC 
CATCAAATCA 
AAAGTCCTCT 
TCTCATCTCT 
AGCATGGTAC 
GTCTTGACAA 
AGCTGTTTTT 
AGACCACCAC 
ATGGATTTGA 
TTCATCAAAT 
AATTCATAAG 
AAACCAACAC 
GCGTATTGAT 
AAACCTTTCC 
ATACGACCCA 



CGTGCTACTC 
TCAACTCAAG 
ATAACGAATA 
TATAACAGTA 
ATTAGCTTTT 
TTTACTATTT 
TCAAATTCAT 
TAAACGTTTT 
GATTACAGGC 
TGTGCTTGAG 
GCTTGTCCGA 
TGCTATTTTC 
TCATCGCTGA 
ACTGAGTCAG 
CCCTGTCGCT 
TTTATTC TCC 
GAATCTGGCG 
CAGGTGCCTT 
TCAAACCTCT 
CTGTAATAGC 
GCACAGCGTT 
CACGCGCTTC 
TTTCCTTACG 
GTGGTTGCGC 
GGAGGTGCTG 
CATCTACAGT 
CGATGAAGAA 



CTCCTCCAAC 
CAAGCATCTC 
CCTCTTTTTT 
TCTATTAATT 
TTGTTTCAAA 
AGACTTCCTG 
TCGTAATCCG 
TACTTTGGCA 
TTTATCTTCA 
GATATATCTT 
TATTTCTCAT 
CTTGAATGAT 
TTTCGATACT 
CATCCAAGTC 
CTTGGATAAT 
TGAAAATTCA 
AATCGTACTA 
GACACCAAAC 
GAGGCTGTCC 
TGTCTTGAGC 
TCCCGTGAAA 
CACGTTTCCG 
AAGCGGGTCG 
AATGCCACGA 
GGCTGTATTT 
CGGCAATGTT 
TCCAGCAGCC 



ACCTGCTTTG 
AGTATCCATA 
ATTTTTAATA 
TTTTTCATCC 
ATCATCTCTA 
CAAAACTAGA 
AAGCGTTTAC 
AAGATATTCT 
ACTGTTAGCG 
CATGAGCCCT 
TTCTAAAAAC 
TTTAACCACA 
AAATTCATCC 
GTCTTTCAGA 
GGTCACAATA 
CGCGCAGTCT 
TAAACAGCCT 
AAGACCGCTC 
TTGAGAAGGA 
AAGCCCATGA 
CCATCTGCCA 
ATAAAGTTCA 
CCCTTGCTAC 
ACATTCTTGG 
TCTGCATTAG 
GACATAAGTC 
AACAAAGCAC 



GCAAAATCTC 
ATTTCAAATT 
TTTGTCTTGT 
AAATCTTGAA 
TTTTTAAAAA 
ATCCTAGTTC 
GATGATTTCG 
CAACCTTGCT 
GCTTGAGTTT 
TGATAACCAC 
CATCTACTTC 
TCTCCTACAC 
TCCAGCGTCA 
CTCAAGGATT 
CTGTCAAAAA 
GGGCAACTAC 
TGGCATCGCT 
CACCAACATC 
GGGCACCTAG 
TTCCCATAGC 
CAACAACATC 
AACTTTCATC 
TCTCTGTTCC 
CATAGAAAGA 
CACCGAGGTC 
CAGGACGGTC 
CTGTATTCCC 



CCCAATTGCA 
GTGACATCTT 
TACAAACTTG 
TTGGCATCGA 
AGATGTTTTC 
ATGATTGATA 
ATAGGTTGTT 
TCTCTCCTTA 
GCTGGATTTA 
TGTCAGCCAA 
CGTTGACTTG 
TTTGGAGTTG 
AGATAAACTC 
CTGTCACGAC 
TTTCTTTTTC 
TTCTGTTTCT 
TGAGCCATGA 
TGAATAATTG 
TTTCGCTCGA 
TGTCCCTTCG 
TGCAACGCCA 
AGCCGCCAGT 
G7TGTTGAGC 
ACCTAGGACA 
AAGCATGTCA 
GATATTCTTG 
AGCCGAAAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
• 600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140. 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
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ACAGCGTCTG CTTCACCATC TTTGACAGCC TTGGCTGCCA ATACCATACT GGCATTTTTC 
TTATTCCGAA TAG 



1680 
1693 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 
{C ) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



CTAAATCTGC 


TTGCTTAGTC 


CACTTGCTTG 


AGCCAAGGAG 


TCACATACTC 


AAAATTTTCA 


60 


TGAAAGTCAT 


AGGTACCGTC 


TTCTTTTTTA 


GCTGAAAAGA 


AAAGTCCATC 


GTGGTAGAGC 


120 


AAATCCCCGT 


GGGCCATAAT 


TCTGGCAGTT 


TTTTCCTCTT 


CCTACTCCTG 
*- 


AGACTTTTGC 


180 


TTAGTCCCCT 


CTTGAGAAAT 


AGTATCTCGT 


TTTTGACTAG 


TCAAGGGATT 


CCTTGGAAGC 


240 


TTTCAAACAA 


CAAGACCAAG 


CCCATTGATA 


AACCAACTGC 


TAGCAAGAGT 


ATCGCCACAA 


300 


ATCCCTTATT 


GCTCCACTTG 


CGATAACTCC 


TAAAAAGTTT 


ACCAAGCCCT 


TCATAAAACG 


360 


AAAAGCTAAA 


CCACCCTGAT 


TTCGATTTTG 


TCTTCTTTGT 


ATCTTCGTTC 


TCCCTACTTT 


420 


CTTATGCAAG 


CCTTTTCTTT 


TTATTATATC 


ACAGATAAGT 


ATTTCTTTCA 


CAATTGAATT 


480 


GAACTTCCCA 


TCTATTTTCT 


ATAAATCCTA 


AATGCCATAA 


TGCTTTCAAT 


TCCTGTCATT 


540 


TTGTGATATC 


ATGTAGAAGA 


AATGAACTAA 


TCCACAGTGG 


CTTATTCCAA 


GTATACCACT 


600 


TGGGCTTTGG 


CAGTAG 










616 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTAATATAGA ATAATCACCG CCGTTGTGAA AGAACGATTG GATGATAATC CAATCGTTCA 60 

GGGAAATTGG AAGACCTTGG GTTTCCAATT TAGGCATGAG ACACCTTTGG TGGCTGCTGC 120 

CGTCCCTCAC AAGCTAAGGT GATTGTTGAA AAAGAGGAAA AAGGAGAAGA AATGAAACCA 180 

GTAATTTCCA TCATCATGGG CTCAAAATCC GACTGGGCAA CCATGCAAAA AACAGCAGAA 240 

GTCCTAGACC GCTTCGGTGT AGCCTACGAA AAGAAAGTTG TTTCCGCACA CCGTACACCA 300 

58 
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GACCTCATGT TCAAACATGC AGAAGAAGCC CGTAGTCGTG GCATCAAGAT CATCATCGCA 360 

GGTGCTGGTG GCGCAGCGCA TTTGCCAGGC ATGGTAGCTG CCAAAACAAC CCTTCCAGTC 420 

ATTGGTGTGC CAGTCAAGTC TCGTGCTCTT AGTGGAGTGG ATTCACTCTA TTCTATCGTT 480 

CAGATGCCGG GTGGGGTGCC TGTTGCGACC ATGGCTATCG GTGAACTCTT TTTTAGGATA 540 

TAAAACAGGG TTCGGATAAG TTTTTTTGCA AGGTGGATGA TGGCTACATT GTAATGTTTT 600 

CCTTATTCTA ACTTAGTCTT AAGATAGGTC TTAAAACCAG GTGAAAAGCG AAGGCATGCT 660 

TTGGCAGCTT GTATGAGTAC CTACCGCAGA TGAAGGGAAC CCCGTTTGAC CATCCTTCCA 720 

ACTAAATCAA TCTGACCTGA CTGATAAATA GAAGAATCCA GTCCAGCGAA AGCTTGTAAT 780 

TGAGCAGGAT TATCAAAGGC ATGAATATTT CGAATCTCGG CTAAAATGAC CGCCCCTAAA 840 

CGATTCTCAA TCCCAGTAAC CGTCGTGATG ACCGAGTTTA ACTCAGCCAT CAAGTCATTG 900 

ACACATTTTT CCGCCTTGTC AATGAGCCTC TTGTAATGTT TGATGTTTTC ATTACACGAG 960 

ATAAAACGTC TATGCGTTAT CAAACTCATT ACCAATTAAA ACAAATGTGG TTAGATCCTT 1020 

TCGGAAATTG TCAAGCGATT GGAGGAAATG AACTAATCCA CAGCGGCTTA TTCCAAGTAT 1080 

ACCACTTGGG CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG GAGTAAAATG 1140 

AAGACAGTTC AATTTTTTTG GCATTATTTT AAGGTCTACA AGTTCTCATT TGTAGTTGTC 1200 

ATCCTGATGA TTGTTCTGGC GACTTTTGCC CAAGCCCTCT TTCCAGTCTA TTCTGGACAA 1260 

GCGGTGACGC AGCTAGCCAA TTTAGTTCAA GCTTATCAAA ATGGGC AATC CAGAACTTGT 1320 

ATGGCAAAGC CTATCAGGAA TTCATGGTCA ATCTTGGCCT GCTGGTTTTG GGTTCTATTT 1380 

ATCTCTAGGT GTAATATAAA CATGTGTCTC ATGACGCGCG TGATTGCAGA ATCGACCAAC 1440 

GAGATGCGCA AAGGTCTCTT TGGTAAGCTT GCTCAGTTGA CGGTTTCTTT CTTTGACCGT 1500 

CGACAAGATG GCGATATCCT GTCTCATTTT ACCAGTGATT TGGATAATAT CCTCCAAGCC 1560 

TTTAACGAAA GCTTGATTCA GGTCATGAGC AATATTGTTT TATACATTGG TCTGATTCTT 1620 

GTCATGTTTT CGAGAAATGT GACGCTGGCT CTCATCACCA TTGCCAGCAC CCCATTGGCT 1680 

TTCCTTATGC TGATTTTCAT CGTGAAAATG GCACGTAAAT ACACCAACCT CCAGCAGAAA 1740 

GAGGTAGGGA AGCTCAACGC CTATATGGAT GAGAGCATCT CAGGCCAAAA AGCCGTGATT 1800 

GTGCTAGGAA TTCAAGAGGA TATGATGGCA GGATTTCTTG AACAAAATGA GCGCGTGCGC 1860 

AAGGCAACCT TTAAAGGAAG AATGTTCTCA GGAATTCTTT TCCCTGTCAT GAATGGGATG 1920 

AGCCTGATTA ATACAGCCAT CGTCATCTTT GCTGGTTCGG CTGTACTTTT GAA 1973 

(2) INFORMATION FOR SEQ ID NO: 10; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTAAACCAAT AGTTGCTGAA GACCATAATC CAGCTGCTGT TGTTAAACCT TTAACTTTTT 60 

GATTAGTTAC CATGATTGTT CCTGCCCCTA GGAAACCCAA GCCACTAATT ACTTGGGCTC 120 

CCATTCGACT AGGATCACCG CTACCATAAC GACTAGTGAT GAACTGATTT GTCATCATTA 180 
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CAACACAAGT 


TCCCAAACAA 


ACTAGTAAGT 


AGGTTCTAAT 


CCCTGCTGCT 


TGGTTTTTGA 


240 


CTCCTCGCTC 


ATAGCCAACA 


ATGC C ACCGA 


AAAGAATCGC 


TAAAAAGCAC 


CTAAGAAGTA 


300 


TTTCCCAAAT 


ACTCAGTTCG 


TATGAAAGAT 


TCATATTATC 


TCTTACCTCG 


.TTTACCTTGG 


-yen 


AATAGGCTTG 


ATAAATAAAG 


AGC TGCACTA 


GACATAATCA 


TTAAAATTAC 


ACjAATAAACA 


420 


AACATCATTG 


CCTGTGCATT 


TAAAGTTGCT 


GTTTCATCAG 


TAGACTGTTT 


AATAACGATT 


480 


CCCAATGGTT 


GGAAAAGTGG 


ATGGTACAAG 


AATACAGATA AGTCATAGTC 


AVjATAATAAA 


540 


GAATTAAAGT 


TTAACACAAC 


TACTGATAAT 


ACGACTGGTA 


r\f\f\. in! M^VjVj 


i ATAATAACT 


600 


CGCACCATAG 


TGTAGAAACT 


AGATGCACCC 


ATACTACGTG 


CAGCTTCTTC 


CATATCATTA 


660 


TCTATACTGA 


AAAATACAGC 


CCGAATCATT 


CTATAAGAAA 


ATGGTAATTT 


TTGAATAGTA 


720 


TATGCAATAA 


GTAAAATAAT 


TACTGTTCCT 


ACTAAAACTA 


AATTAAATAG 


TATTAAATGA 


780 


GGTATATTAT 


AAGTGAACAT 


TAATCCTAGT 


GCAATCAATG 


TACCTGGCAA 


TATCCATGGT 


840 


ATCAGTGCAC 


CATACTCAAA 


uftrt i x i n. i V-Vj 


AACTTACTCT 


TGTTTTTATG 


TACAATACGT 


900 




x X Vjv- 1 x\ 1 /in 1 




ACCGCAGCTA 


AAATTGCATA 


AACAACGCTG 


960 


ACTAGGTAAG 


GACGAAATGA 


TTGAGCATCT 


GTAAATAAAT 


TAGCATAGTT 


CGCTAACGTA 


1020 


AATTTAGATA 


GATCTAAAGT 


TCCCGTCTTG 


ATCGTCAACG 


AGTCTGTAAA 


CGAGTATAGA 


1080 


ATTATCAAAA 


CTATTGGTAA 


CATATAGATT 


GCAAACAATA 


CATACGCAGC 


AATGTGAGCA 


1140 


AGAATTATTC 


CATAATGGAG 


AAGAAAATTA 


TCTGCTTCCT 


TAATACTAGC 


CTGGTTTTCG 


1200 


AAACAGAAT 












1209 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



CTAAGTCCTT 


TTAGTTTTAT 


CTTAATTCTC 


TTATTGTTGT 


AATAATCAAT 


ATAGTCTATA 


60 


ATGGCTTGTT 


CCAATTGCTT 


AAGCGACTGA 


AACGACTTCT 


CATAACCGTA 


AAACATTTCC 


120 


GATTTCAGAA 


TCCCAAAGAA 


AGATTCCATC 


ATACCGTTGT 


CTTGGCTGTT 


TCCCTTGCGT 


180 


GACATAGATG 


CTTGAATTCC 


CTTACTCTTA 


GGAACCGATG 


ATAAGAATCG 


TGTTGGTATT 


240 


GCCAGCCTTG 


GTCACTATGG 


AGAATCGTAT 


TCTCGTAGTG 


CTTCTCTGTG 


AATGCCTGTT 


300 


CCAACATTGT 


TTGTACTTGT 


TCTAAGTTGG 


GTGAAGTTGA 


AAGATTATAG 


GCGATAATTT 


360 


CGCTATTAAA 


GCCATCTAAA 


ACTGGTGATA 


AGTAAAGCTT 


TTGAGTACTT 


GCTGGAATGG 


420 


CAAATTCTGT 


CACATCTGTG 


TAGCACTTTT 


CCATTGTTTT 


AGAGCCTTCA 


AATTGGCCTT 


480 


GAATGAGATT 


CTCTGCCTTC 


TTACCAACGT 


CTCCTTTATG 


AGAAGATTTC 


GTTTCTGTCG 


540 


CATTTTAGCT 


TGTAAATTGA 


GTACTTTCAT 


CAAGCCTTGA 


ACTCTTTTAT 


GATTTACCAG 


600 


ATAAGCACGA 


TTCCTTAGTT 


CTAAATGAAT 


ACAGCGATAA 


GCATAATTTC 


CCTTGTGTTC 


660 


GATAAAAATG 


GATTGAATTT 


CAGCTTTAAG 


CTCTTGGTCC 


TTATCTGGTT 


TGTNTAG 


717 



60 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CTAAATCACC AGATTTAACC AAGTCGTAGA TACGGCCTGG TGTTCCAATA ACAATATGAG 
GCTGATTGCT TGCCAATTTC TCAATCTGGC GAGCCTTATC CGTACCACCC ACATAATTAA 
CCACACGAAC TTCGACATCT GAGTGAGCTG AAATCTGACG CGCTACTTGG TAAATTTGAG 
TAGCCAACTC ACGACTCGGT GCAGTAATCA CTGCTTGTAC ACTATCGCTA GCTTCATCTA 
ATTGCTGGAA AATCGGTAAC AAGAAAGTAT GAGTCTTACC TGAACCTGTT TTTGATTCTC 
CTACTAGGTC ACGACCTGCC AAAACAATAG GAATCAACTT GTCTTGCACC TCTGTTGGAG 
TTGTAAATTT TAACTCCTCC AAGGCTTCTC TAATATAGTT TTTAAATTGA AATTTCGTAA 
ATGACATAAC ATCCTCGATT CTATCTATCT TATCAATTAT ACCATATTTT ATTCCATTAC 
AGTAGTCTCA CTTATTTAGG CTATTTCCAG TAGCTTCTCT AGTAAGAAAA GGCTGGAATT 
TTATAGTTCC AACCTCTTTT CAGTTATTAT TTCCAGTTTA ACATAGCATT CAAGCCATAG 
TGATCACTCA CTTGTGGACT CTTGTTACCA TCAAATACGA CATGTAAATT TTCCACCGCT 
AACTCTTTGG TAGTAAAGAC ATAATCGATT CGAAGGGGTT CAGTGTTCCC TTTCCAGCCA 
TCAATTTCAG GCGGAACAGT ATAGCTACCA CTTTTCTCTT GAGCAACTTC AAATGCGTCT 
TGTAAGCCTA ATGGACTAGC TAAAATAGCT TGGTAACCTT CCCTCACCTG CTGGGTTGTT 840 
AAAATCTCCA G U bq ° 

851 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 541 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



CTG^TGAGAT TGTTACGAAA TAACTGAAAA TATTTAAGGA GAAAATATAT GTTGAAACAC 
TTAAACTTAA AAGGTCACTT ATTGACAGCC ATTTCCTATA TGATTCCAAT TGTTTGTGGT 
GCAGGATTCT TAGTTGCCAT TGGTTTAGCA ATGGGGGGTG GTGTTCCTGA C^^S 
GCAGGAAAAT TCACTATCTG GGATGCTTTA GCAACTATGG GTGGTAAAGC CC^^ 



60 
120 
180 
240 
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TTGCCAGTTG TTATTGCTAC AGGTTTGTCT 

CCAGGTTTTG TTGTTGGTCT AATTGCCAAT 

TTGGGAGGTT ATATAGTTGG TTTCTTGGTT 

AACTGGATTA AAGGTTTAAT GCCAACCTTG 

GTTTGATTAT GATTTATATT ATTGGGGCGC 
G 



PCT/US97/05306 

TACTCGATTG CTGGTAAGCC AGGGATTGCA 300 

TCTGTTGGTT CAGGGTTTAT CGGTGGTATC 360 

CAAGCGATTA TTAAAAAGGT CAAAGTACCA 420 

ATTATTCCTT TTGTACCTCT TTGGTAAGTA 4 80 

CTATCGCAGC CTTTACCAAC TGGTTGACGA 540 

541 



(2) INFORMATION FOR SEQ ID NO:14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CTGGAATGAG 


TGTATAAGCC 


CAGCCCAAGT 


TTTGCATCCG 


TTCAAAGTTC 


CAAGACCCTT 


60 


GTAAGAAGGT 


TGAACGCCAC 


CAAACTTTTT 


TACGATCTGA 


TTTAGTTAAT 


TGAAGTTTTT 


120 


CAGTCATGAT 


GTTTTCAGTC 


CTTTCTTATC 


TTAGTAGTCT 


TCTAGGATAT 


CGCCGATTGG 


180 


GTCGTTAGAA 


GTTGCGGCTC 


CTCCGCCACC 


ATTTCCACCA 


GTTTTAGAAA 


GGTGAAGGTA 


240 


GATAAGAGCG 


ATAGCAACAC 


CGATAGCACC 


GAATCCGATT 


AGAGTAATAT 


CTGACACAGC 


300 


AGCGAGAACG 


AAACCAAGAG 


CGAAGAATGG 


CCATACTTCA 


CGAGTTGCCA 


TCATGTTGAT 


360 


AACCATGGCG 


TAACCAACGG 


CAACGACCAT 


ACCACCACCG 


ATAGCCATAC 


CATCTTTGAG 


420 


CCAGTCTGGC 


ATGGCACTAA 


GGATACTTTG 


TACAGTTTCA 


GTTGGTACCA 


TAAGGAGAAG 


480 


AGCTGCAGGA 


AGCGCGATAC 


GAAGTCCTTG 


GAAAAGTAGC 


GCAATGAAAT 


GCGCACGCTC 


540 


CACAGCGCCG 


AAGTCACCTT 


TTTTAGCGGC 


AGCATCTGCA 


GTATGAACCA 


AACCAACTGA 


600 


AATTGTACGA 


ACAATCATTG 


TCAAGAAAAG 


TCCAGCTACG 


GCAAGAGGGA 


TAGCAACCGC 


660 


TTGGGCAACA 


CCGATACCAG 


TCTTGGTAAA 


GTCACCACCA 


AGAACCATGA 


TAATGGCAGC 


720 


AGCGACAGAA 


GCAAGTGCAG 


CATCAGGAGC 


GATAGCAGCA 


CCGATATTTG 


ACCAACCAAG 


780 


GGCAATCATT 


TGAAGCGATC 


CACCGAGGAT 


AATCCCTGCT 


TCCAAGTGGA 


CCTGTTACAA 


840 


GCCCAATAAG 


GGTACAGGCT 


ACAAGTGGTT 


GGTGAAATTG 


GAACTGGTCG 


AGGATGCCTT 


900 


CAAGACCTGC 


AAAGGAAGGC 


TACAACGACT 


ACTAAAACCA 


TAGAAATAAT 


AGACATGTTT 


960 


AAAATCCTTT 


CATAAATAAT 


GGCTTATTTG 


ACATTGGCTT 


TGTTAATCAA 


GTCAAACAAA 


1020 


TCTTTTTTAG 


AATCATTTGG 


TACTTTACGG 


ACATCAAATT 


CAACACCCAA 


GTCACGCATT 


1080 


TTTTCAAATG 


TAGCAACATC 


TTCTTTGTCC 


ATAGACAAAA 


CGGTATTGAC 


CAATGTTTTA 


1140 


CCTGTTGAGT 


GAGCCATAGA 


ACCAACGTTA 


AGAGTCTTGA 


TTGGCACGCC 


GCCTTCGATG 


1200 


GCACGAAGGG 


CATCTTGAGG 


TGTTTCAAAC 


AAGATAAGGG 


CATGTGTTTC 


TCCAAAACGT 


1260 


GGGTCTTTTG 


AAATATCAAT 


CAGTTTTTGA 


ATTGGAACCA 


CGTTAGCCTT 


GACATTACCT 


1320 


GGAG 












1324 



(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



TCATAAGGAA GCTGTCGCTC GTTCCGCTAA GGTATGGACA CCACGGTGAA CATTGGCATT 
GTCCTGCTCA TAGTAACTGT TAATAGCTTT CAGAACTACT AGTGGTTTTT GTGTCGTCGC 
AGCATTGTCC AGATAGACCA GAGGTTCATC ATTGACAATC TGATCTAAAA TTGGAAAATC 
CTTGCGAATC GCTTCTACAT CTAACATAGG CTTCCCCTTA GCGTTTTGAC AATTTCTCTT 
CGATAGTTGC AATCATTTCA TCACGAACTT CCTTGACTGG AATCTCCACG ATAACAGATC 
CAAGGAAACC ACGAACAACC AAACGCTCTG CAGTTGCCTT ATCCAATCCA CGACTCATGA 
GGTAATACAT GTCTTCTGGA TCAACTTGTC CGATAGACGC TGCGTGTCCT GCAGTGACAT 
CATTTTCATC AATCAAAAGA ATTGGGTTAG CATCTGAACG CGCTTGGTCT GAAAGCATGA 
GAACACGGCT CTCTTGTTGC GCATCTGCTC CCTTAGCACC CTTGATGATG TGGCCGATAC 
CATTGAAAGT CAAAGTTGCT TTTTCAAGGA TAACCCCATG TTGTAGGATA TTTCCGATAG 
AGTTGCAGCC ATAGTTAGTT ACACGAGTAT CAATCCCTTG TACCTGACGA CCACTTGAAA 
GAG 

(2) INFORMATION FOR SEQ ID NO: 16: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
663 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 649 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

CTAGTTGGAT GGCTTCAATA AAGGATGATT TGGCTGCTCC ACTATTGGCA ATGAGCTGAA 
AACAGATATA TTCCATTTCT TCGTCATCTT ATTTCTCCTA TCCATTCAAG TGCTTGTTCC 
AGAACTTTTG CTCCATTCAT CATTCCGTAA TCCCGCATAT CAATGGTATC TACAGGGATA 
TTTCCTGCAA TTTCTTTCAC AGCAAGTAAC TCATAACGAA TTTGTGGCCC AATTAGAATG 
ACATCTGCTT CATGGATATT CTTTTTAGCT TCTGTCATTG ATTTTGCTTG GATAGAAATT 
TCAATCCCAC GTTCAGTCGC ACTTTGTTGC ATTTTTTTAA CAAGCATACT TGTCGACATT 
CCCGCATTAC ATACTAATAA AATTTGTTTC ATAATCTTAA CCTTCCATTT CTTGTTCAAC 
AACTTTGTCA TTAACTTTGA TAAATGGAAT GTATAGAAGA ACTCCAAGTG CAAAGATGAT 

63 



60 
120 
180 
240 
300 
360 
420 
480 
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GAATTGAACT AGAACTGCTC TCACGTCCCC TGCTGTTGCT AACCATGCAT TTAAGAATAC 540 

TGGTGTAGTC CAAGGAACTT GTATAAATGC AGGACTCATG AATCCTGTAA CTGTTGCTAA 600 

GTAGCTGATT AAAATACCAA GGACTGGAAC TGTGATAAAT GGAATAGTC 649 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CTTGGTATTA 


TTTGAAATCA 


GATGGTTCTT 


ATGCAAAAAA 


TGCATGGCAA 


GGAGCTTATT 


60 


ACCTTAAATC 


AAACGGTAAA 


ATGGTACAAG 


GTGAGTGGGT 


TTATGATTCT 


TCTTACCAAG 


120 


CCATGGTATT 


ACTTGAAATC 


AGATGGTTCA 


TATGCTCGCA 


ATGCATGGCA 


AGGAAACTAC 


180 


TATTTGAAAT 


CAGATGGTAA 


AATGGCTGTC 


AATGAATGGG 


TTTATGATGC 


CACCTATCAA 


240 


GCATGGTATT 


ATTTGACATC 


AGATGGTTCT 


TATGCTTACA 


GTACATGGCA 


AGGAAATTAC 


300 


TATCCTAAAA 


TCGGATGGTA 


AAATGGCTGT 


CAATGAATGG 


GTTGATGGTG 


GACGTTATTA 


360 


TGTTGGCGCT 


GACGGAGTTT 


GGAAGGAAGG 


TCAAGCAAGT 


ACAGCTTCTC 


CTAGTAATGA 


420 


TAGCAATAGT 


GAATATTCCT 


GCTGCTTTAG 


GAAAGGCAAA 


AAGTTATAAT 


TCGTTATTCC 


480 


ACATGTCAAA 


AAAAACG 










497 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTAGTCTAAA 


TTTTTTAAAA 


CAAAGGTCAA 


AGATAGTCAA 


TATCAGTAAT 


CATAACTAAG 


60 


TAAACAAAAA 


GAGGTAAAGA 


ATATGAATAA 


CAACTTTAAT 


AATTTTAATA 


ACATGGATGA 


120 


TTTATTTAAC 


CAATTGATGG 


GTGGTATGCG 


AGGATACAGT 


TCTGAAAATC 


GCCGTTACTT 


180 


AATTAATGGA 


CGCGAAGTCA 


CACCTGAGGA 


ATTTGCTCAC 


TATCGTACGA 


CTGGTCAATT 


240 


ACCAGGAAAT 


GCAGAAACTG 


ATGTGCAAAT 


GCCACAACAG 


GCATCAGGTA 


TGAAACAAGA 


300 


CGGTGTCCTT 


GCAAAACTAG 


GTCGAAACTT 


GACAGCAGAA 


GCGCGTGAGG 


GCAAGTTGGA 


360 
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TCCTGTTATC GGACGAAACA AGGAAATTCA 
CAAGAACAAT CCTGTTTTGG TCGGAGATGC 
TCTAGCGCAA GCCATTGTGA ACGGAGATGT 
TTCTATTGAT ATCTCAGGTC TTGAGGCTGG 
TGTCCAAAAC TTAGTCAATG AAGTGAAAGA 
AATTCACCAA ATTCTTGGTG CTGGTAGCAC 
GGATATTCTC AGCCAATCGA TCTCTCTCGT 
CAAGACGAAT ACCGTAACAC CATCTTGAAG 
GTGAAGGTCA ATGCTCCTTC AGCAGAGAAT 
CTCTATCAAC AACACCACAA TGTCATCTTG 
TATTCTGTTC AATACATTCC TCAACGTAGC 
GTAACGGCTG CTCACTTGGC GGCTCAACAT 
GAAATCGAAA CGGAAAAAGA CAAGCAAGAA 
GCTCTAAACT ATAAAACACG CATTGCAGAA 
GATATGAAAG TGACTGCAAG TGTCAACGAT 
ATCCCAGTAT CGCAAATGGG AG 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 



AGAAACATCT 


GAAATCCTCT 


CACGCCGCAC 


a *>n 


AGGTGTTGGT 


AAGACAGCAG 


TTGTCGAAGG 


a on 
4oU 


TCCTGCTGCT 


ATCAAGAACA AGGAAATTAT 


b40 


TACTCAATAC 


CGTGGTAGCT 


TTGAAGAAAA 


600 


AGCAGGGAAT 


ATTATCCTCT 


TCTTTGATGA 


C C f\ 


TTGTGGAGAC 


AGTGGTTCTA 


AAGGGCTTGC 


Tift 


GGAGAATTGA 


CAGTGATTGG 


GGCAACAACT 


780 


AATGCTGCTC 


TTGCTCGTCG 


TTTCAACGAA 


840 


ACTTTTAAAA 


TTCTTCAAGG 


CATTCGTGAC 


900 


CCAGACGAAG 


TCTTGAAAGC 


AGCGGTGGAT 


J D U 


TTGCCAGATA 


AGGCTATTGA 


CCTTGTCGAT 


1020 


CCAGTAACAG 


ATGTGCATGC 


TGTTGAACGA 


1080 


AAAGCAGTTG 


AAGCAGAAGA 


TTTTGAAGCA 


1140 


TTGGAAAGGA 


AAATCGAAAA 


CCACACAGAA 


1200 


GTGGCTGAAT 


CTGTGGAACG AATGACAGGT 


1260 








1282 



ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTTGATATAA TTCTTGTTTC GAAAGATTCT CCTTAGGTAT ATCTATTCCT CCACTAGTAA 60 
ACGGTAATTC CAAAACAGAG TTTACTTCGT TAAATGTAAG CCAATATTTA ACTTTATCTT 120 
TATACCTTTC TAAAACTGTT CGAGCAAATT TTTCATAAAA ATGAATCATT CTTCCTATCA 
ATCCATCCAT GATATTTTCT TGCTAAATAT AATGGAGTCT CATAGTGTGA AAGAGTTACA 
AGTGGTTCTA TCCCGTGAGC ATGTAGTTCA TCAAACAATT CATCATAATA TTTCAATCCA 
GCTTCGTTAG GTTCTTCCTC ATCTCCTTTT GGAAAAATTC TACTCCATGC AATAGAAGTA 
CGAAAAACAT TAAAGCCCAT TTCAGAAAAC AAGGATATAT CTTCCTTATA TTTATGATAA 
AAATCAATAC CTATCAATTT TAAGTTATCT TCTGTAGGAT TTTCTGTTGC TTCTCCTAAT 
CCACCTTTGG GTAACACATC CTGAACTGAT AAGCCCTTAC CATCTTCATT ATATGCTCCC 
TCTACTTGAT TAGCTGCAAC AGCTCCACCC CAAAGAAAAT CATCTGGAAA AATGGTCATA 
ACTTTCCTCC ATTATAATAT TACCAGTAAT TCCTTAGAAA TGCTCGATTG TCTGATTATT 
AGGTAATATT AATACATCTA GAAAATCATT GGTATTCGTT ACAATTACTG GTGTAACTGT 
TTCGTAGCCT TTAGTCTTGA TTAAATTCAA GTCCATTTCA AAAATCAACT GATTTTTGAA 
AACTCTGTCT CCTTCTTCTA CATGACTAAT AAAACCTTGA CCTTTTAG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
828 
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(2) INFORMATION FOR SEQ ID NO: 20: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



AGCTATGGTC 


AGACTCAGAT 


TGATGGCGTT 


GCTTATGCCA 


AGTACGATAT 


CTTCCGTTTA 


60 


AAGAACGGGA 


AAATTGTGGA 


GCATTGGGAT 


AATAAGGAAG 


TCATGCCTAA 


GGTAGAAGAC 


120 


TTGACCAATC 


GAGGGAAGTT 


TTAAATTGAG 


GACAAAGAAT 


GATTGAATAC 


AAAAATGTAG 


180 


CACTGCGCTA 


CACAGAAAAG 


GATGTCTTGA 


GAGATGTCAA 


CTTACAGATT 


GAGGATGGGG 


240 


AATTTATGGT 


TTTAGTAGGG 


CCTTCTGGGT 


CAGGTAAGAC 


GACCATGCTC 


AAGATGATTA 


300 


ACCGTCTTTT 


GGAACCAACT 


GATGGAAATA 


TTTATATGGA 


TGGGAAGCGC 


ATCAAAGACT 


3 60 


ATGATGAGCG 


TGAACTTCGT 


CTTTCTACTG 


GTTATGTTTT 


ACAGGCTATT 


GCTCTTTTTC 


420 


CAAATCTAAC 


AGTTGCGGAA 


AATATTGCTC 


TCATTCCTGA 


AATGAAGGGG 


TGGAGCAAGG 


480 


AAGAAATTAC 


GAAGAAAACA 


GAAGAACTTT 


TGGCTAAGGT 


TGGTTTACCA 


GTAGCCGAGT 


540 


ATGGGCATCG 


CTTACCTAGT 


GAATTATCTG 


GTGGAGAACA 


GCAACGGGTC 


GGTATTGTCC 


600 


GAGCTATGAT 


TGGTCAGCCC 


AAGATTTTCC 


TCATGGATGA 


ACCCTTTTCG 


GCCTTGGATG 


660 


CTATTTCGAG 


AAAACAGTTG 


CAGGTTCTGA 


CAAAAGAATT 


GCATAAAGAG 


TTTGGGATGA 


720 


CAACGATTTT 


TGTAACCCAT 


GATACGGATG 


AAGCCTTGAA 


GTTGGCGGAC 


CGTATTGCTG 


780 


TCTTGCAGGA 


TGGAGAAATT 


CGCCAGGTAG 


CGAATCCCGA 


GACAATTTTA 


AAAGTGCCTG 


840 


CAACAGACTT 


TGTAGCAGAC 


TTGTTTGGAG 


GTAGTGTTCA 


TGACTAATTT 


AATTGCAACT 


900 


TTTCAGGATC 


GTTTTAGTGA 


TTGGTTGACA 


GCTACAATGA 


CATTGGTCGG 


TTCCTTGAGC 


960 


AAGAGATAGA 


TTAGCCAGAC 


AGTCATGCCC 


AAAATCCCTC 


CAGGTAAGAG 


CATAGACCGT 


1020 


TGCACATTAA 


GTACGATTAA 


AAAAGTGATA 


ATGGCAAGAA 


AACTTGCTAC 


TGCTTGTAAT 


1080 


AAAAAGGTTG 


TTAGTGTCAT 


ATTAGTTCAT 


CAATACCAAG 


GCGACAGAAG 


TTCCTGCCCC 


1140 


TAAAGCGAGG 


GTAATGAGCA 


GGGATTCAAA 


CATCTTACTC 


ATACCAGAGT 


TTATGTGGTT 


1200 


GGTCATAATA 


TCACGGACCG 


CATTGGTCAA 


GGCAATACCT 


GGTACAAACG 


GCATGACCGC 


1260 


ACCAGCTATA 


ATCAAATCCT 


GCCCGTTTGA 


ATGGAAAAAC 


CCTGTGTTAG 


CCGAGCCCCA 


1320 


AAACTGGGGC 


CAATTTATCC 


CCCAAAGACA 


AAAGCTCCAT 


CAAAGGCTGT 


CACAAAGGGA 


1380 


ATTCGGATAA 


ATTTTCCACA 


TAGAAGGAAA 


AGGCAAAACC 


AAATAAGGTC 


GCCACTCCTG 


1440 


CCCCAAGTGC 


TCGTAAATAT 


TCCGCT 








1466 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AAGGTTCTTC GTAAATGTAC ACCATTCGGA TGCTCAAAAT GGGTTCTGAA GCTGCTGCTA 
AATCTGCACA AGAACACGGT CTTAAATCAG TTGAAGTTAC TGTAAAAGGT CCAGGTTCTG 
GTCGTGAGTC AGCTATTTCG TGCGCTTGCT GCCGCTGGTC TTGAAGTAAC AGCAATTCGT 
GATGGGACTC CAGTGCCAAC ACAATGGTGC TTCGTCCTCC AAAACGTCGC CGTGTATAAT 
CATCGCATTA CACTGCTTTT CGTTTAAGAG GGAGTAACTA AATGATCGAG TTTGAAAAAC 
CAAATATAAC AAAAATTGAT GAAAATAAAG ATTATGGCAA GTTAGTAATC GAACCACTTG 
AACGTGGCTA CGGTACAGCT CTTGGTAACT CTCTTCGTCG TGTACTTCTA GCTTCTCTAC 
CAGGAGCAGC TGTGACATCT ATCAACATTG ATGGTGTGTT ACATGAGTTT GACACAGTTC 
CAGGTGTTCG TGAAGACGTG ATGCAAATCA TTCTGAACAT TAAAGGAATT GCAGTGAAAT 
CGTACGTTGA AGACGAAAAA ATCATCGAAC TGGATGTTGA AGGTCCTGCT GAAGTAACAG 
CTGGTGACAT TTTGACAGAT AGCGATATTG AAATTGTAAA TCCAGATCAT TATCTCTTTA 
CAATTGGTGA AGGTTCTTCT CTAAAAGCGA CTATGACTGT TAACAGTGGT CGTGGATATG 
TACCTGCTGA TGAAAATAAA AAGGATAATG CACCAGTTGG AACACTTGCT GTAGATTCTA 
TTTATACACC AGTTACAAAA GTCAACTATC AAGTGGAACC TGCTCGTGTA GGTAGCAATC 
ATGGTTTCGA CAAATTAACC CTTGAAATCT TGACAAATGG AACAATTATT CCAGAAGATG 
CTTTAGGGCT TTCAGCACGT ATTTTGACAG AACATCTTGA TTTGTTTACA AATCTTACTG 
AGATTGCTAA GTCAACTGAA GTGATGAAAG AAGCTGATAC TGAATCTGAC GACCGTATTT 
TAGATCGTAC GATTGAGGAA CTGGACTTGT CTGTGCGTTC ATACAACTGT TTAAAACGTG 
CCGGTATCAA TACTGTGCAT GATTTGACAG AAAAATCTGA AGCAGAGATG ATGAAAGTAC 
GAAATCTTGG ACGCAAGAGT TTGGAAGAAG TGAAACTCAA ACTCATTGAT TTGGGTCTTG 
GATTAAAAGA TAAATAAAGG AGGAATACAT GGCTTACCGT AAACTAGGAC GCACTAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CTATTGAAAC AAGAAAAATA GAGAATCAAA GAAAGAGAAC TTATGAATAT TCAAGAAGAA 
ATTAAGAAAC GTCGTACCTT TGCCATTATC TCCCACCCGG ACGCGGGGAA AACAACCATC 
ACTGAGCAGT TACTCTAACT TTGGGGGTGA GATTCGTGAG GCTGGTACGG TAAAAGGGAA 
GAAAACAGGG ACTTTTGCTA AATCTGACTG GATGGATATC GAGAAGCAAC GTGGGATTTC 
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TGTTACTTCA 


TCTGTTATGC 


AATTTGACTA 


CGACGGCAAG 


CGCGTGAATA 


TCTTAGACAC 


300 


GCCAGGGCAC 


GAGGACTTCT 


CAGAAGATAC 


CTATCGTACC 


TTGATGGCGG 


TGGATGCTGC 


360 


GGTCATGGTC 


GTGGACTCTG 


CCAAGGGGAT 


CGAGGCTCAA 


ACAAAGAAAT 


TGTTTGAGGT 


420 


TGTGAAACAT 


CGTGGCATTC 


CAGTCTTTAC 


CTTTATGAAC 


AAGTTGGATC 


GTGACGGTCG 


480 


TGAGCCTTTG 


GATCTCTTGC 


AAGAATTGGA 


AGAAATCTTG 


GGCATTGCTA 


GCTACCCTAT 


540 


GAACTGGCCT 


ATCGGGATGG 


GGAAAGCCTT 


TGAGGGCTTG 


TATGACCTCT 


ATAACCAACG 


600 


TTTAGAGCTT 


TACAAAGGGG 


ATGAGCGTTT 


TGCTAGCCCT 


AGAAGATGGA 


GACAAACTTT 


660 


TTGGTAGCAA 


TCCTTTCTAC 


GAGCAAGTCA 


AGGATGACAT 


TGAGCTTTTA 


AATGAAGCTG 


720 


GGAATGAGTT 


TTCAGAGGAA 


GCTATTCTGG 


CTGGAGAATT 


GACGCCTGTC 


TTTTTCGGTT 


780 


CAGCCCTGAC 


AAACTTTGGT 


GTGCAGACCT 


TCCTTGAAAT 


CTTCCTCAAG 


TTTGCTCCAG 


840 


AACCACATGG 


TCACAAGAAA 


ACAGACGGTG 


AAATTGTGGA 


TCCTTATGAC 


AAGGATTTCT 


900 


C AGGC TTTGT 


CTTTAAAATC 


CAAGCCAACA 


TGGATCCTCG 


TCACCGTGAC 


CGTATTGPCT 


960 


TTfJTP PfiT AT 


CGTATCAGGC 


GAATTTGAGC 


GTGGCATGAG 


TGTCAATCTC 




1020 


GTAAGGGTGC 


C AAACT ATC T 


AATGTTACCC 


AvjTTTATGOC 


(jGAGAGTCG T 


GAGAATGTGA 


1080 


CCAATGCCGT 


AGCAGGTGAT 


ATTATCGGGG 


TTTACGATAC 


CGGTACTTAT 


CAGGTTGGGG 


1140 


ATACCTTGAC 


GGTTGGAAAA 


AACAAGTTTG 


AATTTGAACC 


ACTGCCAACC 


TTTACTCCTG 


1200 


AAATTTTCAT 


GAAAGTTTCT 


GCTAAGAATG 


TTATGAAGCA 


AAAATCCTTC 


CACAAGGGGA 


1260 


TTGAGCAATT 


GGTGCAAGAA 


GGAGCCGTTC 


AGCTTTATAA 


GAATTACCAA 


ACAGGTGAGT 


1320 


ACATGCTGGG 


AG 










1332 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



TTGGACGCTG 


GAACAAATCC 


AAGCTGACCT 


GTTCCAAGAC 


CAGACTTGGT 


ATGCTCTGGC 


60 


TTATGATGGG 


GCAGAAGTGA 


TTGGCTTTCT 


AGCTGTTCAG 


GAGACTCTTC 


TTTGAAGCAG 


120 


AAGTCCTGCA 


AATCGCTGTC 


AAAGGAGCCT 


ATCAGGGTAA 


GGGCATTGCG 


TCAGCCTTGT 


180 


TTGCTCAATT 


GCCGACAGAC 


AAGGAAATTT 


TCCTCGAAGT 


CAGACAGTCA AATCAACGAG 


240 


CGCAAGCATT 


TTACAAGAAA 


GAAAAGATGG 


CAGTTATCGC 


TGAGCGAAAG 


GCCTACTACC 


300 


ATGACCCAGT 


CGAGGACGCC 


ATTATCATGA 


AGAGAGAAAT 


AGATGAAGGA 


TAGATATATT 


360 


TTAGCATTTG 


AGACATCCTG 


TGATGAGACC 


AGTGTCGCCG 


TCTTGAAAAA 


CGACGATGAG 


420 


CTCTTGTCCA 


ATGTCATTGC 


TAGTCAAATT 


GAGAGTCACA 


AACGTTTTGG 


TGGCGTAGTG 


480 


CCCGAAGTAG 


CCAGTCGTCA 


CCATGTCGAG 


GTCATTACAG 


CCTGTATCGA 


GGAGGCATTG 


540 


GCAGAAGCAG 


GGATTACCGA 


AGAGGACGTG 


ACAGCTGTTG 


CGGTTACCTA 


CGGACCAGGC 


600 


TTGGTCGGAG 


CCTTGCTAGT 


TGGTTTGTCA 


GCCGCCAAGG 


CCTTTGCTTG 


GGCTCACGGA 


660 


CTTCCACTGA 


TTCCTGTTAA 


TCACATGGCT 


GGGCACCTCA 


TGGCAGCTCA 


GAGTGTGGAG 


720 
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CCTTTGGAGT TTCCCTTGCT AGCCCTTTTA GTCAGTGGTG GGCACACAGA GTTGGTCTAT 780 

GTTTCTGAGG CTGGCGATTA CAAGATTGTT GGAGAGACAC GAGACGATGC AGTTGGGGAG 840 

GCTTATGACA AGGTCGGTCG TGTCATGGGC TTGACCTATC CTGCAGGTCG TGAGATTGAC 900 

GAGCTGGCTC NTCTNGGGCA GGANATTTAT GA 932 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 1158 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 24: 

CTAAACTTTC GCTCATAGGC ATACAAATTA ATCCTTTGGC ATAAGTAGCC ATAAAATTAA 60 

CATTTTCTGT TGTAGCTGCT TGTGCAGAAC AAATTAAGTC TCCTTCATTT TCTCTATCCT 12 0 

TGTCGTCTAT AACAAGAACA AGTCGTCCCT TCTGCAATGC TTCTAATGCT TCTTGTATTT 180 

TTCGATATTC CATTGACTGA TTATCCTTTC TGCTAAAATC CATTTTGATA TAATAGTTCC 240 

TTCGATATTT CTGATTTTGG AGAGTTATCC ATCAGTTTTT GCACATATTT ACCTAAGATA 300 

TCATTTTCAA GATTTACTGT ACTCCCGACT TGTTTACTCT TAAGAATGGT TTGTTTCAAG 360 

GTATGAGGGA TAACAGATAC TGAAAAGTTT ACTTTGGAGA CTTTAGCGAC AGTCAGACTA 420 

ATGCCGTCAA TTGTAATAGA TCCTTTTTCA ACTATTAAAT CTAAAATTTC TTTTTGTGTG 480 

TTGATTTGAT ACCATACAGC ATTATCATCT TTTTTTATTG ACGAGATTTT TCCTGTACCA 540 

TCAATGTGTC CTGTAACGAC GTGACCCCCA AGTCGACCGT TGACAGATAA GGCTCTTTCT 600 

AGATTCACCT CACTTCCATG TTTTAATAGA GTAAGAGCTG TTCGACTCCA TGTTTCATTC 660 

ATTACATCAA CTGTAAAGGA TTGATGATTG AAATGAGTAA CTGTAAGACA GATACCATTT 720 

ACTGCTATAC TATCGCCTAA ATGGATATCC GTTAATATTT TTGAGGCTTT AATTGATAGT 780 

TTACAATTAC GAGAGTCTTT CTGTATTCCT TCAACTTTTC CGATTTCTTC AATTATTCCT 840 

GTGAACATGG ATAAATCACT TCACTTTCTA TGAGATAGTC NTTTCCTNTT TGAGAAAAAG 900 

CATAAGGTTT CAATCTAATA GCGTCATTTG GCAAAGAAAT GCCTTCACCT CCGACAGGAA 960 

ACTTGGCACT GCCTCCAAAA ATTTTTGGTG CAATATATAT TTTCAGCTCA TCAACAATTT 1020 

GTTGTTCCAA AGCACTCCAA TTCATTAGAC TGCCCCCCTC TACAACTAGG CTATCAATCT 1080 

GCATGTTTCC TTAGATGTTN GCATTAAACT CNGATAAGTC TATATGATTG CCTTTTTTCT 1140 

TTATGGAAAG TATTCCCC - - r „ 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TAAAGGCTGT 


TTGGATAATG 


TAACATAAAG 


AAATCTTGTT 


TAGCAGGGAT 


TGAATAGAGT 


60 


TCCTGAAGTG 


ATGACTTCTT 


TATTAGACAT 


AATGTCTGGA 


CCCAAGATTA 


AAAGAATGGC 


120 


CCCGAAGAAT 


ACGAGCATCA 


AAGGTAGCAG 


ATGCAACAAC 


TGTATCGTGG 


AAGATTGAGA 


180 


GGAACTTAGG 


TAATTTAAGA 


TTGTCTAAAC 


TTTCTTCTTT 


CTTACCAAAG 


CGTCCTGCTA 


240 


CTTTATCTAC 


AAACCAGATT 


GCAAATTGCT 


GTTGGTGACC 


AATCGCAAAT 


CCGCCACCAC 


300 


CAGTCAAGCG 


TTGAGTTGCC 


TCAACAGTCA 


TATTTGAACT 


AACTGCCCAG 


TAAAGTCCAC 


360 


AGATGATACC 


AATCGCTGCT 


GTACCGTAAG 


CATTGCGCAA 


TTGTGGTACT 


AAGAATAGAA 


420 


CCATAAGAGA 


TACTGTTGCA 


GCTAGCCATG 


CTGATCAATT 


CCCTCAACCA 


GTCCTATATC 


460 


TCAGCCAAGA 


GAATCGAGGA 


GGTCTTTGCG 


GAATCTCCCG 


AAAACATCCA 


TTCAGAATTA 


540 


GAACAAAAGC 


AAGTTACCAG 


TGGTCGGGTT 


TTACAAGTCC 


AAGAATTGAC 


TTTTACCTAT 


600 


CCTGATGCGG 


CCCAGCCTTC 


TCTGAGAGAC 


ATTTCCTTTG 


ATATGACTCA 


AGGACAAATC 


660 


CTTGGTATCA 


TTGGGGGGAC 


TGGTTCTGGT 


AAATCAAGCT 


TGGTGCAACT 


CTTACTTGGA 


720 


CTTTATCCAG 


TAGACAAGGG 


GAACATTGAC 


CTTTATCAAA 


ATGGACGTAG 


TCCTCTTAAT 


780 


TTGGAGCAGT 


GGCGGTCTTG 


GATTGCCTAT 


GTACCTCAAA 


AGGTCAAACT 


CTTTAAGGGA 


840 


ACTATTCGTT 


CCAACTTGAC 


TTTAGGTTTA 


AATCAAGAAG 


TATCTGACCA 


GAAACTCTGG 


900 


CAGGCCTTGG 


AGATTGCGCA 


AGCTAAGGAT 


TTTGTCAGTG 


AAAAGGAAGG 


ACTTTTGGAT 


960 


GCCCTAATTG 


AAGCAGGGGG 


GCGAAATTTC 


TCAGGTGGAC 


AAAAACAAAG 


GTTGTCTATC 


1020 


GCCCGAGCAG 


TCTTGCGCCA 


AGCTCCGTTT 


ATCATCCTAG 


ATGATGCAAC 


CTCGGCACTG 


1080 


GATACCATTA 


CAGAGTCCAA 


GCTCTTGAAA 


GCTATTAGAG 


AAAATTTTCC 


AAACACGAGC 


1140 


TTAATTTTGA 


TCTCTCAACG 


AACCTCAACT 


TTACAGATGG 


CGGACCAGAT 


TCTCCTCTTG 


1200 


GAAAAAGGTG 


AGTTGCTAGC 


TGTTGGCAAG 


CACGATGACT 


TGATGAAATC 


CAGCCAAGTC 


1260 


TATCGTGAAA 


TCAATGCATC 


CCAACATGGA 


AAGGAGGACT 


AGAATGAAAC 


GACAAACTGT 


1320 


AAACCAGACG 


CTCAAACGTT 


TAGCCGTAGA 


TTTAGCAAAC 


CATCCCTTCC 


TCCTTTTCCT 


1380 


AGCCTTTCTA 


GGAACTATTG 


CCCAAGTTGG 


CTTATCAATT 


TACCTACCTA 


TTCTGATTGG 


1440 


GCAGGTCATT 


GACCAAGTCC 


TAGTGGCTGG 


TTCATCACCA 


GT 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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CTCTGAAGGA TCCAGAAAGT CGCTTATATA TTTTATTAAA AGACGGTCAG GTTATTGGGA 60 

CTTGTACGGT TGATTTATCG ACTAATACGA ATTACTTCTA CGGTTTAGCA ATATTGGAAC 120 

CTGAACGTGG AAAAGGCTAT GGAAGCTACT TAGCAAAATC CCTCGTCAAC CAACTAATTG 180 

AGCAAAATGA CAAGGAATTT CAGATTGCAG TGGAAGATAG CAATGTAGGT GCCAAACGTT 24 0 

TGTATGAAAA AATTGGCTTT GTCAAACAGA CTCAGGTGGT TTATCTGAAT GAGAAAGGAG 300 

CAAGGGATTC CGAAGTGTAG AGATATTCGG ACTGAAATTT ACTTGAACTT TTAGTGATGA 360 

AACTAATTGT TCTTGGATTT CAGCTTTCCT GATTATGATT TATGATTAAA ATCTATGACA 420 

CCATGTCTCG TGATTTGCGA GAATTTGTCC CGATTGAGGA CGGCAAGATC AAGATGTATG 480 

TTTGTGGGCC AACGGTGTAC AACTATATCC ACGTGGGAAA CGCCCGTTCG ACGGTAGCTT 540 

TTTTGGATAC GAATTTCGTC GCTATTTTGA GTACCGTGGG TATAAGGTTG CCTATATTTC 600 

CAATTTTACA GATGTGGATG ATAAGATTAT CAACCGTCCC AGGGAAGAAG GCATCACGCC 660 

TCAGGAGGTT GCGGATAAGT ACATCGCTGC CTTTCGTGAG GATGTGACGG CCTTGGGCGT 720 

GAAACCTGCG ACTCGCCATC CGCGTGTAGT GGAGTTTATG GCAGACATCA TCCGTTTTGT 780 

GGAAGACTTG ATCGAGAAAG GCTTTGCCTA TGAGAGTCAA GGGGATGTCT ATTTCCGTGT 840 

AGAAAAATCC CACAACTATG CTAAATTGGC TAATAAAACC TTGGAAGATT TGGAG 895 

(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 27: 






CTGTTTGCCC 


TTTCTTCTAA 


TAAAGAATTG 


GCAGAACGTG TGGCGCAGGA 


GATTGGGATA 


60 


GAGTTGGGGA 


AATCAAGTGT 


TCGCCAATTT 


TCAGATGGAG AGATTCAGGT 


CAACATTGAA 


120 


GAATCAATCC 


GTGGGAAACA 


CGTCTTTATC 


CTACAATCAA CTAGTTCGCC 


TGTAAATGAC 


180 


AATCTGCTTG 


AAATTTTGAT 


TATGGTAGAT 


GCTTTGAAGC GTGCGAGTGC 


AGAATCTGTC 


240 


AATGTTGTCA 


TGCCTTACTA 


TGGGTATGCA 


CGTCAGGATA GAAAGGCGAG 


AGCGCGTGAG 


300 


CCAATCACTT 


CAAAACTTGT 


CGCAAATATG 


CTTGAAGTAG CTGGAGTGGA 


TCGTTTATTG 


360 


ACCATCGACT 


TGCATGCTGC 


GCAAATTCAA 


GGATTCTTTG ATATTCCTGT 


GGATCATTTG 


420 


ATGGGTGCTC 


CTCTGATTGC 


AGATTATTTT 


GAGCGTCCTG GTATGGTTGG 


TTCTGACTAT 


480 


GTGGTTGTCA 


GCCCGGACCA 


TGGAGGGGTG 


ACTCGTGCTC GTAAGTTGGC 


AGAATTTTTG 


540 


AAAACATCTA 


TCGCTATTAT 


TGAGAAACGT 


CGTAGCGTTG ATAAGATGAA 


TACTAGTGAA 


600 


GTTATGAACA 


CCATCGGTAA 


GGTTGAAGGC 


AACCACTTGT AGCTTCGATT 


GATGATATGT 


660 


ATTGATACCG 


CTGGAACGAT 


TTGTCATGCG 


GCAGATGCTC TTGCGGAAG 




709 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



CTAGAAGAAC 


TCCGTCACAA 


GCTTGTGACT 


TGTAAGCAAG 


AACAGAGCAA 


GATTGAGGAT 


60 


TCCTTATGAT 


TTCATTCCTT 


CTTCTATTGG 


TCTTGGTTTG 


GGGATTTTAT 


ATCGGCTATC 


120 


GGAGAGGCCT 


GCTCTTACAG 


GTTTATTACC 


TGATTTCAGC 


CATGGCATCG 


GCTTTTATGG 


180 


CTGGCCAGTT 


TTATAAGGGG 


CTTGGAGAGC 


AATTCCATTT 


ATTGCTCCCT 


TATGCAAATT 


240 


CGCAGGAAGG 


TCAGGGGACT 


TTCTTTTTCC 


CATCGGATCA 


ACTCTTTCAG 


CTGGATAAGG 


300 


TCTTTTATGC 


AGGTATCGGC 


TACTTGCTTG 


TATTTGGGAT 


TGTCTATAGC 


ATTGGTCGTT 


360 


TGCTTGGTCT 


TCTCTTACAC 


TTGATTCCTA 


GCAAAAAACT 


GGGTGGTAAG 


TTGTTCCAAG 


420 


TTTCAGCAGG 


TATCTTGTCC 


ATGTTGGTGA 


CCTTATTTGT 


CTTGCAAATG 


GCCTTGACAA 


480 


TCTTGGCGAC 


CATCCCCATG 


GCAGTTATAC 


AAAATCCTCT 


TGAAAAGAGT 


ATCGTCGCAA 


540 


AACACATCAT 


CCAGAGCATA 


CCGATAACAA 


CCAGTTGGCT 


CAAACAAATC 


TGGGTGACAA 


600 


ATTTAATCGG 


ATAAAAAGGG 


CAGGAGTTTT 


CCTAGCCCTT 


TGTTTACAGA 


TTTGACTCGA 


660 


ATCTATCAGA 


ATGTAAAAAG 


CTACCACACC 


TAGACATTCA 


AAGACAAGGA 


AATAAAGATG 


720 


AATAAGAAAA 


TATTAGAAAC 


ATTAGAGTTC 


GATAAGGTCA 


AGGCCTTGTT 


TGAGCCTCAT 


780 


TTGTTGACCG 


AGCAGGGCTT 


GGAGCAATTG 


AGACAG 






816 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



CTTGAAGCAG 


CTGAAAGCAT 


GGGGTCTTCG 


GGATTTAAAA 


CCTATTCGTA 


ACGGGTTGTT 


60 


TTACCTTTCC 


TAGTTCCAAC 


CTTACTAGCA 


GCTCCTTGCT 


TGTATTTATG 


AGAGCATTCT 


120 


CAGACTTTGG 


AACGCCTATG 


TTGATTGGCG 


AAGGATATCG 


GACTTTCCCT 


GTCCTGATTT 


180 


ATACCCAATT 


TATTAGCGAG 


GTTGGAGGAA 


ATTCTGCTTT 


TGCAATTATG 


GCGATTATCA 


240 


TTGCCTTGGC 


AATTTTCCTT 


ATCCAAAAAC 


ACATTGCAAA 


CCGCTACAGT 


TTCAGCATGA 


300 


ATCTGCTCCA 


TCCAATTGAG 


CCTAAAAAAA 


CTACAAAAGG 


AAAAATGGCT 


GCCATTTATG 


360 


CAACAGTCTA 


CGGAATTATC 


TTTATCTCTG 


TTTTACCTCA 


AATCTACTTA 


ATTTATACCT 


420 


CTTTCCTAAA 


AACATCAGGT 


ATGGTATTTG 


TTAAAGGTTA 


TTCTCCAAAC 


AGTTACAAGG 


480 
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TAGCTTTCAA TCGTATGGGA TCTGCTATTT TCAATACCAT TCGTATCCCT TTGATTGCCT 540 

TAGTTCTAGT TGTTCCTATT TACGACATTT ATCTCCTACC TAGCCGTTAG AAAACGGAAT 600 

TTGTTTACAA ACTTAATTGA CAGCCTCAGT ATGGTACCTT ATATTGTACC AGGAACCGTT 660 

CTAGGGATTG CCTTCATTTC TTCCTTCAAT ACTGGTCTAT TTGGAAGTGG ATTTCTTATG 720 

ATTACAGGGA CTGCTTTCAT CTTGATTATG TCTCTATCTG TCAGAAGATT ACCGTATACT 780 

ATTCGCTCAT CTGTTGCTAG CTTACAACAA ATAGCACCAA GTATTGAAGA AGCTGCTGGA 840 

AAGCTTAGGA AGTAGTCGTC TCAATACCTT TGCTAAGATT ACAACTCCAA TGATGCTATC 900 

TGGTATCATT TCTGGAGCCA TCTTATCTTG GGTCACAATG ATTTCAAAAC TCTCTACTTC 960 
TATCCTCCTC TACAATGTCA AAACAAGAAC AATGACTGTA G 



(2) INFORMATION FOR SEQ ID NO: 30: 



1001 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1064 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



CTGGTTTGCA ATCTATTCCA ACTGAGATTA CAGAGGCAGC AAGGATTGAT GGTGCGACTA 60 

GCAAGCAAGT TTTCTGGAAC ATTGAATTGC CTTACTTGCT ACCAAGTGTC TCTATGGTCT 120 

TTATCCTAGC CCTAAAAGGT GGGCTGACTG CCTTTGACCA AGTCTTTGCC ATGACCGGTG 180 

GTGGTCCAAA CAATGCCACA ACCTCACTTG GGCTCTTGGT TTATAACTAT GCCTTTAAAA 240 

ACAACCAATT CGGTTATGCC AATGCCATTG CCGTAATCTT GTTCCTCTTA ATTGTAGTGA 300 

TTTCGATCAT CCAATTGAGA GTATCTAAGA AATTTGAAAT TTAAGAGGAG AAGCATGATG 360 

AAACAAGATG AAAGAAAAGC CCTGATTGGC AAATACATTC TATTGATTCT AGGATCGGTT 420 

CTGATTTTAG TGCCGCTCCT TGCTACCCTC TTTAGTTCCT TTAAACCCAC TAAGGATATT 480 

GTAGATAATT TCTTTGGCTT TCCAACCAAC TTCACATGGG ACAACTTTAG CCGTCTCTTA 540 

GCTGATGGGA TTGGAGGCTA TTATTGGACT CTGTCGTCAT CACTGTCTTG TCTTTACTTG 600 

CAGTAATGAT CTTTATCCCT ATGGCAGCCT ACTCCATCGC TCGCAATATG AGTAAAAGAA 660 

AAGCCTTTAC CATTCATGTA TACCCTCTTA ATCCTCGGAA TCTTCGTACC TTTCCAAGTC 720 

ATCATGATTC CGATTACGGT TATGATGAGT AAACTCGGTT TGGCTAATAC CTTTGGTTTG 780 

ATCTTGCTCT ACTTGACCTA TGCGATTCCA CAGACCCTCT TTCTCTATGT TGGATATATC 840 
AAAATCTCGA TTCCAGAAAG TCTGGATGAA GCAGCAGAGA TCGATGGGGC TAATCAATTT 
ACAACCTATT TCCGCATCAT NTTCCCAATG ATGAAACCGA TGCATGCGAC AACCATGATC 
ATCAATGCCC TTTGGTTCTG GAATGACTTC ATGTTGCCAC TCCTTGTCTT GAACCGGGAT 
TCCAAAATGT GGACTCTGCC TTTGTTCCAA TACAACTACG CAGG 



900 
960 
1020 
1064 



(2) INFORMATION FOR SEQ ID NO: 31: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



CTGTAAAAGA 


AGTTAATCAA 


GTTGATAAAG 


CACTTTTAAA 


ACAAATTGGT 


GCAGTTGATG 


60 


TCTTAGAAGT 


GAAGGGTGGC 


ATTCAAGCAA 


TCTATGGAGC 


AAAAGCAATC 


TTATATAAAA 


120 


ATAGTATTAA 


TGAAATTTTA 


GGTGTAGATG 


ATTAAGTACT 


TACTGACTTA 


ATAAAAAACA 


180 


GAGGAGAGTG 


ATGGATGAGT 


AGGATGAAAT 


GAAATCGCAT 


ACAAGAAATA 


AAGAACTCAT 


240 


TATCCAAGTT 


GGATACGCTT 


ATTACATAGG 


AGAATACAAA 


TGAAATTTAG 


AAAATTAGCT 


300 


TGTACAGTAC 


TTGCGGGTGC 


TGCGGTTCTT 


GGTCTTGCTG 


CTTGTGGCAA 


TTCTGGCGGA 


360 


AGTAAAGATG 


CTGCCAAATC 


AGGTGGTGAC 


GGTGCCAAAA 


CAGAAATCAC 


TTGGTGGGCA 


420 


TTCCCAGTAT 


TTACCCAAGA 


AAAAACTGGT 


GACGGTGTTG 


GAACTTATGA 


AAAATCAATC 


480 


ATCCAAGCGT 


TTTGAAAAAG 


CAACCCAGAT 


ATAAAAGTGA 


AATTGGAAAC 


CATCCACTTC 


540 


CAGTCCAGTC 


CTGAAAAATC 


ACAACAGCCA 


TCCGAAGCAG 


GAACAG 




586 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



TGGTAGTTCT 


TCCTAGTGCA 


GGGGCTGTAG 


ACCCAGTTGC 


GACCCTAGCG 


CTGGACTAGT 


60 


CGAGAGGGTG 


TTGTTGAAAA 


TGGATGGNTA 


TCGCTATGTT 


GGTTATCTAT 


CAGGTGACAT 


120 


CCTCAAAACG 


CTTGGCTTGG 


ACACTGTTTT 


AGAAGAAACC 


TCAGCAAAAC 


CTGGAGAGGT 


180 


GACTGTAGTC 


GAAGTTGAGA 


CTCCTCAATC 


AACAACAAAT 


CAGGAGCAAG 


CTAGGACAGA 


240 


AAACCAAGTA 


GTAGAGACAG 


AGGAAGCTCC 


AAAAGAAGAA 


GCACCTAAAA 


CAGAAGAAAG 


300 


TCCAAAGGAA 


GAACCAAAAT 


CGGAGGTAAA 


ACCTACTGAC 


GACACCCTTC 


CTAAAGTAGA 


360 


AGAGGGGAAA 


GAAGATTCAG 


CAGAACCATC 


TCCAGTTGAA 


GAAGTAGGTG 


GAGAAGTTGA 


420 


GTCAAAACCA 


GAGGAAAAAG 


TAGCAGTTAA 


GCCAGAAAGT 


CAACCATCAG 


ACAAACCAGC 


480 


TGAGGAATCA 


AAAGTTGAAC 


CACCAGTAGA 


ACAAGCAAAA 


GTCCCAGAAC 


AACCCGTGCA 


540 


ACCTACACAA 


GCTGAGCAAC 


CAAGTACACC 


AAAAGAATCA 


TCACAACAAG 


AAAATCCTAA 


600 


AGAAGATAGG 


GGAGCGGAAG 


AGACACCGAA 


ACAAGAAGAT 


GAACAGCCAG 


CAGAAGCCCC 


660 


AAGAAATCAA 


GGTTGAAGAA 


CCAGTAGAAT 


CAAAAGAGGA 


GACTGTTAAT 


CAACCTGTTG 


720 
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AACAACCAAA AGTGGAAACG CCTGCTGTAG 
AAGTTGAAGT AACAAGTATT CCCCAAACTA 
ACGGAACGCG TGAAGTTGTT AAGGAAGGTA 
ATATC TTGAA TGCGACAGAT GGTACGACTA 
AGATGGAGAA AGAGGTTGTT CGTGTTGGCA 
TAAGTTTGAC AAGTGTTACA GATAATGCAA 
TGGAAAATAC AGATAGTGTT GATGTGAAAA 
AGGTTGTCAA AACTATTGAC TTATCTAAAG 
AACTTTATAA AGATTATAAG ATTGTGACGA 
AAACCTCTAC GTTGGAAGAA ACTCCACTAC 
ACATCGGCTC TACTAATCTC GTCAAAGTAA 
TCTTAACAAG TAAACCTGTG GATGTGCAGA 
ATAAAGTTGT TTCCCCTCCC AGTTGAAAAA 
CTTTACAAAG TCCCTGCTAA GGCCCTAATT 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 

AAAAACAAAC GGAACCAACA GAGGAACCAA 780 

CTCGCTATGA GGAAGACCTT ACTAAGGAAC 840 

AGAATGGCAG TAGAACAGTT ACTACTCCAT 900 

CAGAAGGCAC TTCGACAACT GATGAAGCTG 960 

CGAAACCCAA AGAAAAATTA GCTCCAGTCT 1020 

TGTTGCGTAG TGCGAGACTT ACTTATCATT 10 80 

AAATTCATGC TGAAATTAAA AATGGCGATA 1140 

AGAGATTATC AGATGCTGTT GACGGTCTTG 1200 

GTATGACCTA TGATAGAGGT AATGGTGAAG 1260 

GATTAGACCT CAAGAAGGTT GAATTGAAAA 1320 

ATGAGGATGG TACTGAGGTG GCAAGTGACT 13 80 

ATTACTACCT CAAAGTAACT TCCCGTGATA 1440 

ATTGAAGAGG TGACTGAGGA AGGTCCACCA 1500 

TGAT 1534 

ID NO:33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

CGGATTTGTT TATCTCAAGA AGAGTCGCGA TATTCTCCGT GAAAGTTCAG AATTGATTAN 60 
CCAAACGGTA GAAGATTATC TTCAAGGAGA TGACTTTGAC TGGGCAGATC TTAAAGGGAA 120 
GGTTCGANAT AATTTGACCA AGTATCTCTT TGACCAAACC AAGCGTCGTC CANCTATTTT 180 
ACCAGTAGTC ATGGAAGCAA AATAATCGTT GAAATAAACA GAGAGAAAGT CGAGTTTCGG 240 
CTTTTTCTTA TAGAAAAATA GAAGGAGAAA ATCATGGCAG TAATGAAAAT CGAGTATTAC 300 
TCACAAGTAT TGGATATGGA GTGGGGGGTG AATGTCCTCT ACCCTGATGC CAATCGAGTG 360 
GAAGAACCAG AGTGTGAAGA TATTCCCGTC TTGTACCTTT TGCACGGGAT GTCTGGAAAT 420 
CATAATAGTT GGCTTAAGCG GACCAATGTA GAACGCTTGC TTCGAGGAAC TAATCTCATC 
GTTGTTATGC CCAATACCAG CAATGGTTGG TACACCGACA CCCAGTATGG TTTTGACTAC 
TACACGGCTC TAGCAGAGGA ATTGCCACAG GTTCCTGAAA CGCTTCTTCC CTAATATGAC 
TAGCAAGCGT GAAAAGACCT TTATCGCTGG TCTTTCTATG GGAGGCTACG GCTGCTTCAA 
ACTGGCTCTT ACGACAAATC GTTTTTCTCA TGCAG 



480 
540 
600 
660 
695 



(2) INFORMATION FOR SEQ ID NO: 34; 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1096 base pairs 

75 



WO 97/37026 PCT/US97/05306 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTCATCCAAG 


AATAGCATAA 


AAGAACTCTT 


CTGTTTTTTC 


TTGAGGAACG 


AGAAAGTCAG 


60 


CAATCTCCAT 


TCCACGATCC 


ATCAAATCCT 


CTAAAGACAT 


CGTGATTTTT 


AAAGTAGTAT 


120 


CACTAATTTG 


TTTCATTTTC 


ATTGCTAGTA 


ACCTCATACT 


TTCAGTTCTA 


TCTATTATAC 


180 


TAGATTTTTA 


CGATTTTATC 


AAAAGAAGGC 


TCCTCTATAC 


GGATAGATTT 


TCCCTAGGGT 


240 


CTTTCTATAG 


GAGACTCCAA 


GAGAAAATTT 


CTGCAGACAG 


ATAGAAAAAG 


CCTTCAAAAT 


300 


CGGCTCTTAG 


CCGATTTTGA 


AGACCTTATA 


CATCAGAATA 


CTTATAATTT 


AAAGGTTGCT 


360 


ACACCGAGGA 


TAGAACGATT 


TAAGTTTCTG 


AGAATTTGAA 


GACTTTGCTC 


AAATTTCTTA 


420 


TAACGAATTC 


ACTCCGTACT 


CTTCAACAAG 


AAGGACTGTA 


TCTCTTTCCA 


AAAGAGATGA 


480 


TACATCCTGC 


AAATCTACAA 


ATGCATTCCT 


TTTAAAGCTT 


CTTGACTCTG 


TTTCAATTTA 


540 


TCTAGGATAG 


CTTTATTTGA 


GCTAACGATG 


GTCAATTCCT 


GTCCAGTATT 


TTTGTATGAC 


600 


AAAACATCTG 


CTAGGTTAGC 


AATTGTTGTA 


ATCTCTGTTA 


CAAAATCAAT 


TTGATACTGA 


660 


GAAAAATCAC 


CTACTCTATT 


GATTGTTGGA 


TTAAAGAGAT 


AAACTAACAC 


ATTTCCCATC 


720 


ACAACCAAAA 


TCACACAAAC 


CACTCCAATA 


ACAACTAAAC 


GAAGAATCAG 


ATTTTTCACA 


780 


TTTAAGCCAA 


GCGCTGTTTC 


ACCATTTGCG 


TTCAATTCTT 


TAGAGTTGAT 


GGTTTCCAGT 


840 


TTTTCAATTT 


TCACATTTGC 


ATAAGCATGT 


TTAAATTTCT 


CAATCAACCC 


ATCAATTTTT 


900 


TTCTCTAACA 


AGTCATTGGC 


ATCTTTACTT 


GATGTCAAAA 


TTTTCACACC 


AACCCCTGCA 


960 


TCGTCAATCA 


TATAGTAGAC 


GGTCAATTTT 


TTCCACCAAT 


AGTCATTCGT 


TGAATTTTTC 


1020 


AAGGTTGTTT 


CCGTCGTGTC 


TAATTCACTG 


GCAATTTTTT 


TCAACTCACT 


GGGTTCTACA 


1080 


TCATTGAAAA 


GATAAG 










1096 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1037 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TCCCCATCCC AGTGGGCAGC AGCATTAGTC TGTACTTCTG CGTCGTTGCG TTTTATTAGA 60 

AGACTGATTT TGCTTTGAAC TACTTGAAGC TGTCGTTGTA TCCTGCGAAG TTTTTGCTAA 120 

CTTCAGTTTC ATTTGCAGCG ACATGACTAG TTGCCAAGCC TAGTAAACAG ATACTTGCTA 180 

ATCCAATTTT TGTTTTCAAT CTTTCCTCTC CTATAAAAAA TGTAACAGAC ATCTGAATGC 240 
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TGTTCCACCT AGCTTTTGCT ACTTACTGAT 
CTTTCAGCTT CAAGTGACGC ACCACCTACA 
TATGAAGCAA CATTTTCAGG TTTAACAGAA 
ACTTCTTGAC CAAAGTCAGC AGCTACAACG 
TCGTCTTGTG AAGCTGATTT ACCAGTACCG 
GAGGCAGCAA CTTGTTCAGC AGTCAATCCA 
AATTCAGCAG CTTTACCAGC TTCGTAAGTT 
AGCATACCGT TCGCAAAGAT TGCTTTTGCT 
AAGTAGTCAC GGCGTTTCTG AGTGACCGAT 
AACTTGTGGG CTAGTTTCAC CAGTGAAAGC 
AGCGACTTTT AAGTTTGAGC CTTNGGCAAC 
AGCAGNGATA CCTGCTTCAA CAAGATCTGA 
GAATGNTCCA GCTTNTTGGG GATTTTTGTT 
ACGTGACATT TCACATA 

(2> INFORMATION FOR SEQ 
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TATTTTACAA 


AGTCAAGCAA 


Akj*_ UAALjAAo 


300 


AGGGCACCGT 


CAACGTCTGG 


VjL. AAljt- (_ A TO 


360 


CCACCGTATT GAACACGAAC 


1 1 1\3 lx, I GCG 


420 


TCACGAACAA 


CTTTACACAT 


I I 1 J i\j I\jL.A 


480 


ATAGCCCAGA 


TTGGCTCATA 




540 


GCCAATGCAG 


CAGATACTTG 


AIjLAL 1. 1 ACG 


600 


TCAAGTGATT 


CACCACAACA 


taA 1 V>ATTGGA 


660 


TTTTTGTTGA 


TATCTTCGTC 


AGTTTCATGG 


720 


AACAACGTAG 


TCAGTACCGA 


i 1 1UTTTCAA 


780 


ACCTGCATTT TCTCCGTAGC 


AGTTTTGAGC 


840 


AGCAAGNACA 


GTTGTCAAAT 


CAAGAGCTGG 


900 


TGAAGGAAGT 


TTTGATGCAA 


CTGCTTCAAC 


960 


CATTTTCCAG 


TTACCAGCGA 


TAAATGGTTT 


1020 
1037 



ID NO:36: 



<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

CTTATCGTAC TAAGGATGGC AGTGTTCAAC TGTTCCGTCC TGATGAAAAT GCTAAACGCC 
TGCAACGTAC ATGTGACCGT CTCTTGATGC CAACAAGTTC CGAACAGACA TGTTTGTAGA 
AGCTTGTAAA GCAGTTGTCC GTGCGAATGA AGAATACGTA CCACCATACG GAATAGGTGG 
AACTTTATAT CTTCGCCCTC TTTTGATTGG TGTCGGAGAT ATTATCGGGG TAAAACCGGC 
AGAAGAGTAC ATTTTCACCA TCTTTGCTAT GCCAGTTGGA AATTACTTTA AAGGTGGTTT 
GGTCCCAACC AAC TTCTTGA TTCAGGATGA GTACGACCGT GCAGCACCAA ATGGTACAGG 
TGCGGCTAAG GTTGGTGGAA ACTATGCTGC AAGTCTCTTA CCAGGAAAAA TGGCCAAGTC 
ACGCCATTTC TCAGATGTTA TCTATCTGGA CCCATCAACT CATACAAAGA TTGAAGAAGT 
CGGATCAGCT AATTTCTTTG GAATTACAGC TGATAATGAA TTTGTAACAC CATTGAGTCC 
ATCTATCTTG CCATCTATTA CCAAGTATTC CTTGCTTTAT TTGGCAGAAC ATCGCTTGGG 
ATTAACTCCT ATTGAGGGTG ATGTTCCAAT TGATAATCTT GACCGTTTTG TAAAGGCAGG 
TGCCTGTGGT ACAGCAGCGG TTATTTCTCC AATTGGAGGT ATTCAACATG GTGATGATTT 
CCATGTTATT CTATAGTGAA ACAGAAGTAG GTCCTGTGAC ACGTAAATTA TATAATGAAT 
TGACGGGTAT TCAGTTTGGC GATATTGAAG CGCCAGAAGG TTGGATTGTA AAAGTAGATT 
AAAATAAACC AAAGGAGATT TTTTATGAAA TAGAAAAAGT GGCTCTTAAC AGCAGGAGTG 
GTCCTGAGCA CGTCAGCTAT TTTAGTGGCT TGTGGAAAAA CTGATAAAGA AGCAGATGCA 
CCGACAACAT TTTCTTATGT CTATGCAGTA GATCCAGCAT CATTGGGCTA CAGTATAGCG 

77 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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ACTCGAACAT CGAGGACAGA CGTTATTGGA AATGTTATTG ATGGTTTGAT GGAAAATGAT 1080 

AAATACGGCA ATGTTGCTCC TTCTCAAAAA GACTATGATT TGAACAGTAC AGGATGGGCT 1140 

CCAAGCTATC AAGATCCAGC GTCTTACTTG AATATTATGG ATCCAAAATC TGGTTCTGCC 1200 

ATGAAACACC TTGGCATTAC GAAAGGAAAA GATAAGGATG TTTGTAACTA AACCTGGTTT 1260 

GGATAAATAT AAGAAATTGT TAGAAGATGC TGTTTCTGAG ACCACTGACC TAGAGAAGAG 1320 

ATATGAAAAA TATGCCAAAG 1340 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



CTCTGACGGC 


GTCGCCACTT 


AAGAAGAGTA 


TCAAAAAGAA 


AAATAGAAAA 


TTAACTAACA 


60 


AGGNAGAAAA 


ACACATGTCT 


AAAATTATCG 


GTATTGACTT 


AGGTACAACA 


AACTCAGCAG 


120 


TTGCAGTTCT 


TGAAGGAACT 


GAAAGCAAAA 


TCATCGCAAA 


CCCAGAAGGA 


AACCGCACAA 


180 


CTCCATCTGT 


AGTCTCATTC 


AAAAACGGAG 


AAATTATCGT 


TGGTGATGCT 


GCAAAACGTC 


240 


AAGCAGTCAC 


AAACCCAGAT 


ACAGTTATCT 


CTATCAAATC 


TAAGATGGGA 


ACTTCTGAAA 


300 


AAGTTTCTGC 


AAATGGAAAA 


GAATACACTC 


CACAAGAAAT 


CTCAGCTATG 


ATCCTTCAAT 


360 


ACTTGAAAGG 


CTACGCTGAA 


GACTACCTTG 


GTGAGAAAGT 


AACCAAAGCT 


GTTATCACAG 


420 


TTCCGGCTTA 


CTTCAACGAC 


GCTCAACGTC 


AAGCAACAAA 


AGACGCTGGT 


AAAATTGCTG 


480 


GTCTTGAAGT 


ANGACGTATT 


GTTAACGAAC 


CAACTGCAGC 


AGCTCTTGCT 


TATGGTTTGG 


540 


ACAAGACTGA 


CAAAGAAGAA 


AAAATCTTGG 


TATTTGACCT 


TGGTGGTGGT 


ACATTCGACG 


600 


TCTCTATCCT 


TGAATTGGGT 


GACGGTGTCT 


TCGACGTATT 


GTCAACTGCA 


GGGGACAACA 


660 


AACTTGGTGG 


TGACGACTTT 


GACCAAAAAA 


TCATTGACCA 


CTTGGTAGCA 


GAATTCAAGA 


720 


AAGAAAACGG 


TATTCGACTT 


GTCTACTGAC 


AAGATGGCAA 


TGCAACCTTT 


GAAAGATGCG 


780 


GCTGAAAAAG 


CCAAGAAAGA 


CCTTTCTGGT 


GTAACTTCAA 


CACAAATCAG 




830 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

78 



WO 97/37026 PCT/US97/05306 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-38: 



GTGGTGGTTT GACTGCCCTT ATCGAAGCAG GTTTCGAANT CTAAGACANA AGCAGGTTAC 60 

GCTCCAGAAT TGGCTTAACT TTGAAGTTCT TCACGGAAAT GAAATTGATC GTTGACTTGA 120 

TCTACGAAGG TGGATTCAAG AAAATGCGTC AATCTATTTC AAACACTGCT GAATACGGTG 180 

ACTATGTATC AGGTCCACGT GTAATCACTG AACAAGTTAA AGAAAATATG AAGGCTGTCT 240 

TGGCAGACAT CCAAAATGGT AAATTTGCAA ATGACTTTGT AAATGACTAT AAAGCTGGAC 300 

GTCCAAAATT GACTGCTTAC CGTGAACAAG CAGCTAACCT TGAAATTGAA AAAGTTGGTG 360 

CAGAATTGCG TAAAGCAATG CCATTCGTTG GTAAAAACGA CGATGATGCA TTCAAAATCT 420 

ATAACTAATT AGAAATATAT AGCGCTGGAG ATGATTTTAT GAAAAAGATT ATGAGAAAAA 480 

TTGCATCGTT ATTATTGGTT CTAGTTGTAT AATGTAATTA CACCGTCGGT AATAGTGCTA 540 

GCAGACCAAA ATAAAGCAGA TTGGTCGTAT GATGAAAATA CTGTAATTAA CATTTATGAT 600 

GATGCTAATT TTGAAGATGG TAGGTTGCAT ATGACCTTTG AACAATTCTT CAAATTGGCA 660 
CAAATAG 

667 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1397 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CGGTTCCAGA NACTGGTATG GCTAGGAGTC GCGACAAAGT GTATCTTATG ATGGTCGATT 60 

TGATACTCAA TACGCTCAAT ACTTACGGAA AGAATAAACG ATCATTGTCA ACCGGAATAG 120 

TATGATAATC GTTTCTTGGT GGAATAAGTT CAATCTTATC CTTCTTAGGC ACAATAATGG 180 

AAGAGCCCAG TGTTCGATAA ACACGATTAT TAAGGCTGGC AATTTCCGTT AATTGCAAAG 240 

CTTCAATGGT AGGGTGTAAA ACAGCACCAC CAAGAGACTA GTTATAGGCA GTACTACCAG 300 

TCGGTGTCGA AACTGTTAGC CCGTCTCCAC GAAAACGTTC AAAGGGAACA CCATTTATTA 360 

CAATATCTGC CACCATGGTT CGATCAGACC TGCGGATGCT GGCTTCGTTG AGTGCTCTGA 420 

AAATCTTAAC TTCACCATTT TCAAGAAAGA CCTTCACATT CAGAACAGGG TAAGAAACCC 480 

TTGCCCCAGT ATCTAGCTGC AAATTAGTCA CTAGCTTGTC CAACTCAAAA TCACGATAAT 540 

CTGTATAGAA GCCCAAATGT CCAGTATGAA GACCGATAAA GCGGACCTTG TCAAGCTGAT 600 

TTTCGTACTT ATGAAAGGCC GACAAGAGCA TACCATCCCC GCCAATGGAA ATGACAATAT 660 

CCGGATTGGT ATCATTGAGT TATAAACTGA TTTCTCTTCA AACGATCTCG CAATTCATAC 720 

AAAACCCTTT GACTCTGCGG TTTTCTATTG GCTATCAGAT CAATTCGTTT ACCTGTATTC 780 

TTCATCTGTA TCGTCACTGN NTCCAACACC GTCATTTAAT TTTCTACTCA AAGGATCAAA 840 

AAGTGCCTGG GCTTCTTGGA TATCATCACG AATTTCACCC ATTTCTTCAT CCAACTGATG 900 

GGCGATTCTA GCTGTAATTT CCAGTCGCTT CTTAATCTCA TCTGGGAAAT CCCCTTGGTA 960 
CTTGTAGTTG AGAGAATGTT CTATCGTTGC CCAGAAATTC ATGGCCAAAG TACGAATTTG 1020 
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AATTTCTGCC AAAATAGTCT TAGCTCCATT 
ATGATAGGAA CGATAGCCTG ATGCTTTTCT 
TCGCATATCC TGACGCTTGT GCAAAATATC 
AACCATCACA CGTAAGCCAG CAATATCCTG 
GCCACGACGA GCCATTTTTT CTTTGATGCT 
CTCAATTGGA GAATGCTTAT TTTGCTTACG 
CTTTAACTCA CCAACAG 



PCIYUS97/053O6 

GATGGTATCA ACCGTATATT CTACTACCAC 1080 

ATGAGTAATG TAATCTCGCT CCTGTATGAT 1140 

CACTACTTCC TTGACGTCAT CTACAAACTG 1200 

CAAATCGTGT TCCAAGGTCG CATAAGTAAT 1260 

CTCAATTGGC TTGACTCGAC CGGTCACAAA 1320 

ATATTGCTTA CGAATACCAC GAAGTTTAAT 1380 

1397 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



CTCCCATGAC 


ACCTGCACGC 


TCCGTACGGG 


CCACGTGCTT 


TTTAATCCGT 


TCAACAACTT 


60 


CATAACCCGC 


TTCAACATCC 


ACACCCGACT 


GGGCATATGC 


ATTTTTATTT 


GTCATTTGTT 


120 


TTTTCCTTTT 


CTTTAATGGA 


GAATCTGTCG 


CCTACTTGTA 


AAAACTGGTC 


TTTTC TTCC A 


180 


AACTTCTACG 


ATAGTCTTCT 


TCGTAGTCAT 


AGAGAGGCGT 


TGGGTAGTCA 


CCGTCAAAGT 


240 


AAGCGACACA 


GAGACCACCG 


TTCGGCGCAT 


CTGTTTCGAT 


ACCAATCGAC 


TCAATCAAGC 


300 


TATCAATTGA 


AAGATAAGTC 


AGACTGTCCG 


CACCAATGAT 


TTGGCGAGTT 


TCTTCGACCG 


360 


TATGATTGGC 


TGCAATCAGC 


TCCTGACGGG 


TCTGGATATC 


AATCCCGTAG 


AAACATGGAT 


420 


ACGCTAGTGC 


AGGACTTCCA 


ATGGCAACGT 


GAACCTCAGT 


CGCACCCGCT 


TCTTTCAAGA 


480 


GCTGAACGAT 


ACGACGAGAG 


GTTGTTCCAC 


GTACAATGGA 


ATCATCCACC 


ATGACCACAC 


540 


GTTTGCCTTT 


GACAACACCC 


GAAACAGCAG 


ACAGTTTCAT 


CCGCACTCCT 


TGCTCCCGCA 


600 


ATTCTTGAGT 


CGGTTGGATA 


AAAGTTCGCT 


GGGTGTATTG 


GTTTTTGATC 


AGACCCATTT 


660 


CATTTGGTAA 


GCCTGATTCT 


TCCGCAAATC 


CCATAGCCGC 


GCTTAGGGAA 


GAATTGGGCA 


720 


CACCAACTAC 


AATATCTGCC 


TCATGCTTAA 


ATTCTCGCGC 


CAATTGCGCT 


CCCATTCTCT 


780 


TACGTGCCGT 


ATGGACATTG 


ACCCGTGGAT 


ATTAGAATCA 


GGGC 




824 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



i i. i 1 Ab l IT 


TTCACTCCTA 


60 


T* A '1*1 y f~ir* /"•rp/~»/^» 


AAAACGTCCA 


120 


uAAI 1 rCTCC 


GCCTACCAAC 


180 


<- i L>L I 1 UTAT 


CCCTCGCCGA 


240 


CCACCTGTTC 


CATCATATAA 


300 


TTTCTACCAA 


ATAGAGAGCC 


360 


TTGGAAACAG 


ACTGCCGGTA 


420 


GTAAACTAC T 


AAAATAGGCC 


480 


TGCCTTCGTT 


ATCAGGTATG 


540 


AGCCGGTCCT 


GCCGCCTGCT 


600 


TTTCTGCCTC 


TAAAACTTCT 


660 


CTTCAGCCCC 


ATTAATCTCA 


720 


TTTCACCTCC 


AAAAGGATTA 


780 


AAATGCGAGA 


AAGTACATTA 


840 


TTGAGCCTTT 


ATAACGCCAT 


900 


AAAATTAGCA 


GCCATTGGGG 


960 


ACCAGCCACA 


ATTGGATTAG 


1020 



(2) INFORMATION FOR SEQ ID NO: 42; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTCGTAAAAA CACTAAAGGA GGTAAATAAC CAATGTTAGT ACCTAAACGT GTTAAACACC 
GTCGTGAATT CCGTGGAAAA ATGCGCGGTG AAGCAAAAGG TGGAAAAGAA GTAGCATTCG 
GTGAATACGG TCTTCAAGCT ACAACTAGCC ACTGGATCAC TAACCGCCAA ATCGAAGCTG 
CTCGTATCGC CATGACTCGT TACATGAAAC GTGGTGGTAA AGTTTGGATT AAAATCTTCC 
CACACAAATC ATACACTGCT AAAGCTATCG GTGTGCGTAT GGGATCTGGT AAAGGGGCAC 
CTGAAGGTTG GGTAGCACCA GTTAAACGTG GTAAAGTGAT GTTCGAAATC GCTGGTCTAT 
CTGAAGAGAT TGCACGTGAA GCGCTTCGAC TTGCTAGCCA CAAATTGCCA GTTTAAATGT 
AAATTCGTAA AACGTGAAGC AGAATAAGGA GAAGGCATGA AACTTAATGA AGTAAAAGAA 
TTTGTTAAAG AACTTCGTGG TCTTTCTCAA GAAGAACTCG CGAAGCGCGA AAACGAATTG 
AAAAAAGAAT TGTTTGAACT TCGTTTCCAA CCTGCTACTG GTCCAATTGG AAACAAACAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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(2) INFORMATION FOR SEQ ID NO: 43: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



CTTCAGCGAC 


ATATCTATCC 


ACAACTTCTC 


TCGATTCATG 


TTCCTCTGAA 


AATGCCTGAA 


60 


ATTTTAATTG 


ACTAATTTGA 


TCCTGATACG 


AACTATCTGC 


TAACAAAACT 


TCAAGATGGG 


120 


AAACATTTGC 


TAACGGATAA 


GGTCTTCTAT 


CCTTACCTAA 


CCAAGTTTCT 


GTCTCTTCAT 


180 


CCTCTATTAG 


TCCCCAGTTA 


CTGGCAAAGT 


CAGGATGATT 


CTCTAAAAAA 


ATACGTTCTG 


240 


TCTGAAAAGT 


GACTGACCGA 


ATGGGGAAAG 


AAGCTGTTCC 


TCTCTCAAAA 


CTAGTAAACA 


300 


ATGCACGCGC 


AATCCCCTGA 


CGGCGATGAC 


CTGGATGAAC 


CAGTATCGTC 


ACTTCTACAT 


360 


CTTGGTCATC 


TGCATAGACA 


GTTAATAAAC 


CAACAAGTTC 


GCCTTTTTCA 


TAATAAAGGA 


420 


AAAAGGCGGG 


CATGTTTGGA 


TCAAAATTAA 


GCATGTTAGA 


GAGATAGGGA 


TCGCGATAGG 


480 


TACCGTCATA 


GTTTTGGCAA 


CAGTTAATTA 


CTTTTTTCGC 


CTCAGATAGC 


TCTTCTTGGC 


540 


TTAACTTGTT 


TCTTGCTTGA 


ATCATATAGG 


TATCCTCTAC 


AAACCAGACG 


ATCTGTGACT 


600 


GGCATCTTTA 


GCCTGCTCGA 


GTTTATTGAC 


ATAATACTCC 


CGTTTTTCTT 


CGACTTCGTG 


660 


AATAACAGGC 


TCATCTTTCT 


TACCATGAAG 


ACGGACAATC 


TTGGCCGGAA 


TACCGACAAC 


720 


CGTCACATCA 


CTAGGTACAT 


CTGCTACGAC 


AACTGCTGCA 


GCACCGACCT 


TGGCATTTTC 


780 


ACCAATTTCC 


ACAGGCCCGA 


TAACTTGGGC 


ATGGGCTGAT 


ATGAGGGCTC 


CCTTTCGTAC 


840 


AGTCGGATGG 


CGTTTGCCAC 


AGTCTTTCCC 


TGTTCCCCCG 


AGAGTCACTC 


CGTGATAGAG 


900 


AAGAACGCCT 


TTTTCAACAA 


TCGCTGTCTC 


TCCAATCACC 


AGACCAGAAC 


CATGGTCAAT 


960 


AAAGACACCT 


GAATCAATCT 


GGGCTCCTGG 


ATGAATCTCA 


ATCTGAGTCC 


AAAAGCGCCA 


1020 


AAACTGACTG 


TGCATACGAG 


CTAAGAGTTT 


GAAGTCGTAC 


TTCCAGAGAA 


AATGCGAGAG 


1080 


ACGGTGGGCC 


GCCAAGGCCT 


TGACACCTGG 


ATAAGTCAGC 


AAAACCTCCA 


AAGTGGTGCG 


1140 


GGCCGCTGGA 


TCATTTTCTT 


TTACGATATC 


AATGGTTTCG 


CGCCACCACC 


CCATACATTT 


1200 


CTCCTTTTCT 


TACTCTGAAT 


CTT 








1223 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



PCT/US97/05306 



GGTGAGTGCC 


TTTATTAGCG 


GAGCTTCTCA 


AGTGATTGTT 


GCCTTGATTA 


TCTTCCTTTC 


60 


ATGCTCTTTT 


ATCTCTTGCG 


TGATGGGAAA 


GGCTTGCGTA 


ACTATTTGAC 


CCAATTCATT 


120 


CCAAGCAAAT 


TGAAGGAACC 


TGTTGGACAA 


GTTCTATCAG 


ATGTGAATCA 


ACAGTTGTCC 


180 


AACTATGTTC 


GAGGGCAAGT 


GACAGTGGCT 


ATTATTGTAG 


CAGTAATGTT 


TATCATCTTC 


240 


TTCAAGATTA 


TTGGTCTACG 


CTATGCGGTT 


ACGCTGGGGG 


TTACTGCTGG 


TATTTTAAAT 


300 


CTGGTCCCTT 


ATCTTGGTAG 


CTTTCTAGCC 


ATGCTTCCTG 


CCCTAGTATT 


GGGTTTGATT 


360 


GCTGGTCCAG 


TCATGCTTTT 


GAAAGTAGTG 


ATTGTCTTTA 


TTGTAGAACA AACTATTGAA 


420 


GGCCGTTTTG 


TCTCTCCATT 


GATTTTGGGA 


AGTCAATTAA 


ACATCCACCC 


TATTAATGTT 


4 oil 


CTCTTTGTTT 


TGTTAACTTC 


AGGATCTATG 


TTTGGTATCT 


GGGGAGTTTT 


ACTTGGTATT 


540 


CCGGTTTATG 


CCTCTGCTAA 


GGTTGTCATT 


TCAGCCATTT 


TCGAATGGTA 


TAAGGTAGTC 


600 


AGTGGTCTAT 


ATGAATTAGA 


GGGTGAGGAA 


GTCAAGAGTG 


AACAATAGTC 


AACAGATGTT 


660 


ACAGGCTTTG 


GAGGAGCAAG 


ATTTAACTAA 


GGCTGAGCAT 


TATTTCGCCA 


AAGCTTTAGA 


720 


AAATGATTCA 


AGTGATCTTC 


TGTATGAGTT 


GGCAACTTAT 


CTTGAAGGGA 


TTGGTTTCTA 


780 


TCCTCAGGCC 


AAGGAAATTT 


ACCTGAAAAT 


TGTAGAAGAT 


TTTCCAGAGG 


TTCATCTTAA 


840 


TCTAG 












845 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi> SEQUENCE DESCRIPTION: SEQ ID NO:45: 



CTATGGATTA ATCTTCTTCT ATCTTATCCT GTTCCTAAAT ATCTTGGTAG AAAAATGAAG 
CAGTTGTTAC GAGTGTCTGT ATTGGAAAAA AATTTTGTCT ATACAATTCT TTAAATGATA 
TGAAGACTAG GAATGCCAAA ACACTGACAT ATTTCGTTTA AAGCCTATCC AGTCTCTTCA 
ATTGTATTTG TAGAAATGCT ATCAGTAATA TATTCGCGAA CACTATGATT TACAACAGTG 
TGGTCGTACA TCTTTGTAAC TTTATTATAC AACAACCATT TACAGGTGTA AAGTTGAGCA 
GTGTCTTTAG TAAATTTAGG GACTTGTTTT GTCATAGTTT TGATACCTGT ATTCGTTGTT 
TCATGAGTAT TTGACGTTTC TTTTCATGTA GGCATTTCAA CAATTAGAAT GATATTTTTC 
CGAGGATGAA CTTATATATT ATGACTTTTT AGAGTTATGA TGCCTCTTGA TTTGACTATC 
GCTATCAATT TCATCCACTT TTACAGTTTC ATGCAACATA GCGAGAAGGC GGTTTCCTAT 
ATGGAAGTCG TCTTGTTTTC ATATTGTCTA AATACGAGAT ATTTACTACA GGTTGTAGAT 
ATTGCAAGCT CAAATGATAT TATTTTAGAG GAGGAGACAA GTGAAATAAT GCCTGGTGAG 
TTAGATGAAA TAAAAAGTAA GGAGTTGAAA GTACTGGTTC NTTGTGCAGG GTCTGGAACA 
AGTGCGCAAT TAGCCAATGC AATTAACGAG GGCGCTCNAT TAGCAGAAGT TAGAGTGATT 
GCGAATTCAG GAGCGCACGG AG 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 928 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



CTGGGACTCT 


CTTCGTAGAA 


GTCATGCAAG 


AATATTTTGA 


TCAAAAGAGG 


AAATCATGAA 


60 


AAAAAGAGCA 


ATAGTGGCAG 


TCATTGTACT 


GCTTTTAATT 


GGGCTGGATC 


AGTTGGTCAA 


120 


ATCCTATATC 


GTCCAGCAGA 


TTCCACTGGG 


TGAAGTGCGC 


TCCTGGATTC 


CCAATTTCGT 


180 


TAGCTTGACC 


TACCTGCAAA 


ATCGAGGTGC 


AGCCTTTTCT 


ATCTTACAAG 


ATCAGCAGCT 


240 


GTTATTCGCT 


GTCATTACTC 


TGGTTGTCGT 


GATAGGTGCC 


ATTTGGTATT 


TACATAAACA 


300 


CATGGAGGAC 


TCATTCTGGA 


TGGTCTTGGG 


TTTGACTCTA 


ATAATCGCGG 


GTGGTCTTGG 


360 


AAACTTTATT 


GACAGGGTCA 


GTCAGGGCTT 


TGTTGTGGAT 


ATGTTCCATC 


TTGACTTTAT 


420 


CAACTTTGCA 


ATTTTCAATG 


TGGCAGATAA 


CTATCTGACG 


GTTGGAGTGA 


TTATTTTATT 


480 


GATTGCAATG 


CTAAAAGAGG 


AAATAAATGG 


AAATTAAAAT 


TGAAACTGGT 


GGTCTGCGTT 


540 


TGGATAAGGC 


TTTGTCAGAT 


TTGTCAGAAT 


TATCACGTAG 


TCTCGCGAAT 


GAACAAATTA 


600 


AATCAGGCCA 


GGTCTTGGTC 


AATGGTCAAG 


TCAAGAAAGC 


TAAATACACA 


GTCCAAGAGG 


660 


GTGATGTCGT 


CACTTACCAT 


GTGCCANAAC 


CAGAGGTATT 


AGAGTATGTG 


GCTGAGGATC 


720 


TTCCGCTAGA 


AATAGTCTAC 


CAAGATGAAG 


ATGTGGCTGT 


CCTTAACAAA 


CCTCAGGGAA 


780 


TGGTTGTGCA 


CCCGAGTGCT 


GGTCATACCA 


GTGGAACCCT 


AAGTAAATGC 


CCCCCATGTA 


840 


TCATATTAAG 


GACTTGTCGG 


GTATCAATGG 


GGTTCTGCCT 


CCANGGATTG 


TTCACCCGTA 


900 


TTGATAAGGA 


TACGTCACGT 


CTTCTCAT 








928 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CTCTTTTACA GAGGTGTTTC CTATTTTGTG CTTCCATTTT TGGGGCAAGG AATTGTAGAT 60 
GGGGATGGCA ATCCTATCTT TTTATTGATT ATGATATTCG TTTGCTTCAT AGTTTTAGTC 120 
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TTTTTGAAAT GGTTAGACTA TGATTTCACT AGATTGAGAA GGGAGTTTCT AGATACAGGT 180 

TTTCAAAAGT CTCTTACTAA GATTAACTGG GCAATGGGGG CTTATTATCT AGTGATGCAA 240 

AGTCTATCTT ACCTTGAATA TGAACAAGGT ATTCAATCAA CGACTGTTCG CCATCTCATC 300 

CTAGTGTTTT ACCTACTCTT TTTTATGGGG GGTATCAAGA AATTGGATAC CTATTTGAAG 360 

GAAAAACTTC AGGAGGAACT GAACCAAGAG CAGACCTTGC GCTACAGAGA TATGGAACGC 420 

TATAGTCGGC ATATAGAGGA ACTTTACAAG GAAATTCGGA GTTTTCGCCA TGACTACACT 480 

AACCTCTTAA CCACTTACGT TTGGGCATTG AAGAGAAGGA TATGGAGCAG ATAAAAGAGA 540 

TGTACGACTC GGTCTTAAGG GATTCCAGTC AGAAATTGCA GGACAATAAA TATGACCTGG 600 

GCAGATTGGT GAATATTNGT GACCGTGCCC TCAAGAGTCT CCTAGCTGGA AAATTTATAA 660 

663 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

CTAGATTAAT CTGATATTTC TTTTTGGCAA ACCAAATGCT TCCTATCAGA AAAGCTAGCA 60 

ACAGCAACAT GGTAATAATG ATATGAATGG TCATTTTCTT CTCCTATTCT GCCTTTTCAA 120 

TATCTTTTTT CATCTCGTCA ACATTGAACT TAGCAAACAA GTATTGACGG TCTTGGACTG 180 

GGAAACGTTG GTCCAACTGG TCAACTGCTC CCACCTCGAT AATGCCTTCC TTGATGACAA 240 

AATCCATGAC ATGTCCACCG AAGGTCAAAT CATCTGATAT GAAGTGCAGA TGGTAGCCTG 300 

CCACACTGAC CCCATGGAAA ATCTCAGGCG TCCAGAAACC AACAATAGTC CCCGCAACAT 360 

TGTCACGACT ATATTCCGGT TGATGGGTTG CGACATCAGC AAACTTGGTA TCGGGTGTTG 420 

ACTTAGGAAT CATACGCACA TGCATATGTT AAAAATTCCC CCCCGAATCT TGATAGAGCG 480 

GAAAAGATTT TCCCCATCAT AATAAGACTC AATTCGTTCT TTCAATTCCT TGTCTGTCAT 540 

CTCAAAGCGC TGGCGAAAAA TGACCTCTGC CTGATGCGGT ACCACTGCAG CGTAAGGAAT 600 

AAGGGCATCT GGTGACACTT CCACAATTTC TGGCTGGTCT CCTGACCCTT TGG 653 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

85 



WO 97/37026 



PCMJS97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



CTACGATATC 


TTTGGTCTTT 


TGTAAGATAT 


GAGGTCCACC 


CTTATGCGCC 


TCAGTTGGCA 


60 


TTTCATGCGA 


TTCAAGAAGT 


TGCCCCTCTT 


GATCAACCAA 


ACCATACTTG 


ATGTTGGTTC 


120 


CACCGATATC 


AATTGCAACG 


TAATATGTCA 


TAAATACCTC 


CTTTTAGATT 


AGAGGAAGCG 


180 


CTCCTTGGTT 


TCACGAATCA 


AGGCAGCAGC 


CGCTTCTACA 


ACTGGACGAT 


CTTCTTCAGT 


240 


CACTGGTGTC 


AATGGTGAAC 


GAACAGATCC 


AATATTCAAG 


CCTTCATTGA 


TTTTCAAGAC 


300 


TTCTTTGATG 


ACACCGTACA 


TATTTCCATG 


AGCAGAAGTG 


AGTTTACCAA 


TGATTGCGTT 


360 


GATAGCATAC 


TGCAATTCAC 


GCGCTGTTTC 


TAGGTCCTTA 


TCCGCAATCA 


ACTGATTGAG 


420 


TTTCAAGAAG 


AGTTCTGGCA 


TAGCACCATA 


AGTACCACCG 


ATACCAGCCC 


TAGCCCCCAT 


480 


GAGGCGTCCT 


CCTAGGAACT 


GCTCATCAGG 


ACCATTAAAG 


ACGATATGGT 


CTTCTCCACC 


540 


AAGGCTGACA 


AAGGTTTGGA 


TATCTTGAAC 


TGGCATAGAA 


GAGTTC TTCA 


CACCGATAAC 


600 


ACGAGGATTT 


TTCAACATTT 


CTGTGTAAAG 


GCTTGGAGTC 


AAAGCAACCC 


CTGCCAATTG 


660 


AGGAATGTTG 


TAAATCACGT 


AGTCTGTGTT 


TGGAGCTGCA 


GAACTGATAT 


CGTTCCAGTA 


720 


TTTGGCAACT 


GAGTTATTCT 


GGCAAGCGGA 


AATAAATTGG 


TGGAATCCGT 


TGCAATAGCA 


780 


TCTACTCCCA 


AGCTTTCAGC 


ATGGCGAGCA 


AGTTCCATAC 


TATCTTTAGT 


ATTATTGCAA 


840 


GCAACATGGG 


CAATAATGGT 


CAATTTACCT 


TTGGCTACCG 


CCATGACTTC 


TTCCAAAATC 


900 


AACTTGCGAT 


CTTCAACGCT 


TTGGTAGATA 


CATTCACCAG 


AAGAACCATT 


GACATAAGAC 


960 


CTTGAACACC 


TTTATCAATG 


AAGTATTGA 
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(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1671 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



TCAAAGAGTA 


ACAAAGGCAC 


CAAATTCTCG 


ATAGGAACGA 


TTTAGCACGG 


TAAACTTCAT 


60 


CCACTTGGGT 


TCACGGAACC 


AAACCAGCAA 


TAATTTCTTT 


GGGCACGGGT 


TAATAGCATT 


120 


TTGGTCAACT 


AGGAGTAGAT 


AGAACACATT 


TCNTTCTTCG 


TCTATATCAA 


TCTTAACACC 


180 


TGTTTCAGCG 


ATAATCTTGT 


CGATGGTTTC 


TCCACCCTTA 


CCGATGACAA 


TCTTAATCTT 


240 


GTCCACATCA 


ATCTTGATCG 


TATCAATTTT 


CGGAGCAGTT 


GGAGCCAATT 


CTGGACGAAC 


300 


TTCTGGAATG 


GTTGCTTCAA 


TGACATCAAG 


GATTTCAAAA 


CGCGCTTTCT 


TGGCTTGAGC 


360 


AAGAGCCTCC 


GTCAAGATTT 


CTGCAGTAAT 


CCCTTGAATC 


TTGATATCCA 


TTTGAAGGGC 


420 


TGTAATCCCA 


TCACGAGTAC 


CTGCAACCTT 


GAAGTCCATA 


TCTCCAAAGT 


GATCTTCCAA 


480 


ACCTTGGATA 


TCTGTCAATA 


CTGTGTAGTT 


ATTTCCATCT 


GAGATAAGTC 


CCATAGCAAT 


540 


ACCAGCTACT 


GGCGCCTTGA 


TTGGCACACC 


ACCAGCCATA 


AGGGCAAGAG 


TTCCCGCACA 


600 


GATAGAAGCT 


TGAGATGAAG 


AACCGTTTGA 


TTCCAAAACT 


TCTGCTACTA 


GACGGATAGC 


660 
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- CAAGGGCACC 


720 


i CAGAATATTG 


780 


I- A 1\. AATG AT 


840 


w C V- C AL vj AG T 


900 


AAAGG AC GGA 


960 


CGI cgcactt 


1020 


TCAAATTCTT 


1080 


GCTTGAGTTG 


1140 


CACTGTTGTA 


1200 


CTTCTGCTTT 


1260 


I vjACAGCTTC 


1320 


C AC C AGACTC 


1380 


ATTGCTCTGC 


1440 


GTACCCCAGC 


1500 


CAAACATAGC 


1560 


CTTGGACTTC 


1620 


G 


1671 



(2) INFORMATION FOR SEQ ID NO:51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTCAGTATCT TTGTAATCAA CATATCCAAT TTTGTTTGCT GCGATGTANT CAACTTTTTT 
ACGGCGTTTG AATCCGCCAC GACGTNGTTG AGCCATGTTT TTNCTCCTTT ATAAGTTTAG 
TTGTCCATTA GAATGGTAAA TCATCANCTG AAATATCCAA TGGGTTTGTT GCTCCAAATG 
GATTTTCATT ACGTGAAAAG TCTGGTACTG AATTTGTAGG TGCTGAATAG TTTGCAGTTG 
GTGCAGAGTA AGCTCCACCT GTGTGACCCT CACGCACACT ACGGCTTTCC AACATTTGGA 
AATTCTCAGC CACGACCTCT GTCACGTAGA CACGl^TCC TTGCTGGTTA TCGTAACTAC 
GAGTCTGGAT ACGACCTGTC ACCCCGATAA GTGAGCCTTT TTTAGCCCAG TTAGCAAGAT 
TTTCAGCCTG TTGGCGCCAC ATAACGACAT TAATAAAATC AGCCTCACGT TCACCATTTT 
GACTCTTAAA TGTACGGTTT ACTGCAAGAG TAAAAGTCGC AACTGCTACA TTTGATGGGG 
TATAACGCAA CTCAGCGTCA CGTGTCATAC GCCCTACAAG TACAACATTG TTAATCATAG 
TTTACCTTCT TACGCGTCAA TTTTGACGAT CATGTGACGA AGAATGTCAG CGTTGATTTT 
GGAAAGACGG TCAAACTCTT TAGAGAGCTG CATCGNTCAT TTGCTTCAAC GTTAACGATG 720 
TGGTAAAGTC C bATG 720 

731 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



87 



WO 97/37026 

<2) INFORMATION FOR SEQ ID NO: 52: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



CTGGGTTTTA 


CCACGTTTTC 


CTGATGTACG 


AGCGAGATCC 


TTACGGTTCA 


ACATAGTGTT 


60 


GATAAAGCTG 


GATTTACCAA 


CATTTGAACG 


CCCTGCTAGG 


GCAATCTCTG 


GCAGTTCATC 


120 


CTGCGGATAG 


TGGGACTTAT 


TAGCTGCACT 


GAGCAAGATT 


TCAGCATTGT 


GTGTATTAAG 


180 


TTCCATAGTC 


ACCTCTAGGC 


TGTTTCTAGG 


ATCGGTTTAT 


CCGTTCCATC 


GACAGTTTCT 


240 


TTAGTGATGC 


GAACCAATTT 


CACATTTTCC 


TGACTCGGCA 


CCTCAAACAT 


AACATCTAGC 


300 


ATGGTTTCTT 


CGATGATGGA 


GCGAAGTCCA 


CGCGCCCCTG 


TCTTCCGTTC 


GATTGCTTTA 


360 


TTAGCAATCT 


CTTGAAGGGC 


TTCGTCGTCA 


AATTCCAACT 


CAACATCATC 


ATAAGAAAGC 


420 


AAGGTTTGGT 


ATTGTTTCAC 


CAAGGCATTT 


CTTGGCTCTT 


TCAAGATGCG 


AACCAAGTCC 


480 


ATCAACGGAC 


AATTGCTCAA 


GAGCCGCAAA 


AACAGGCAAG 


CGTCCAATCA 


ACTCAGGGAT 


540 


AATACCAAAT 


TTTTGAATGT 


CTTCAGCGAT 


GAATTCTTGC 


ATGTATGAGC 


TGTTTCTCTG 


600 


TCAATCGCCT 


TATTGTTTTG 


ACCAAATCCG 


ATGACTTTTT 


CACCCAGACG 


TTGTTTGACA 


660 


ATTTCTTCAA 


T AC CATC AAA 


AGCACCACCC 


ACGATGAAGA 


GGATATTTTT 


TGTATCCACT 


720 


TGAATCATCT 


ACTTGTTGTG 


GATGTTTGCG 


TCCACCTTGA 


GGCGGCACGC 


TAGCAACAGT 


780 


TCCCTCAATA 


ATCTTGAGAA 


GGGCTTGTTG 


CACCCCTTCA 


CCAGAAACAT 


CACGTGTGAT 


840 


AGACACATTC 


TCACTCTTCT 


TGGCAATCTT 


GTCAATTTCA 


TCCACATAGA 


TAATGCCACG 


900 


CTCTGCACGT 


TCGATGTTAA 


AGTCAGCAAC 


CTGCAAGAGT 


TTGAGGAGTA 


TATTTTCCAC 


960 


ATCCTCACCC 


AC ATAAC C AG 


CCTCCGTCAG 


AG 






992 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1299 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

ATTTTCACAG GGCAGGATGC GGAACGACTA GCCCCATACT TTAACGGAAT TTTGGTAGGG 60 
ACAGCTCTTA TGCAGGCAGA AAATGTGGCC CAGAGAATCA AGGAGTTGCA GATTGACAAA 120 
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* CAGCAGGAGC 


180 


! AAGAGGCAGC 


240 


! TTTCACCAAG 


300 


' AAGTTCACGG 


360 


CTGTGCAGGT 


420 


ATGCCCCTGT 


480 


TAGCACAGCC 


540 


AACATTTTAC 


600 


ATCATGAAAA 


660 


AAAGATGGAT 


720 


GTTTTGGAGT 


780 


TTAAACCAAC 


840 


TTGACCCAGC 


900 


GGAGCCCACA 


960 


AAGAAAATTA 


1020 


GCCCTCTTTA 


1080 


CTCAATGTGT 


1140 


CGCCTGCTCA 


1200 


ACCCACTATA 


1260 
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TCCTTGGTTC TGCCTTGGGG CCTCATCCAT TTCCAGAAA 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1078 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:"54: 

CTGTTGAATC AACAACACGT ATCCAAGTAT CAGNATCAAA TTATGCAGGT AATCGTACAA 
TTGGAAATCA CCGTGGATGG TTCAATCCAA CAACAACTTC TGAAGGTTTT GTTACATATA 
TTTATGCAGA TTAATTTACA GAGGGACTCG AATAGAGCCC TCTTTTCAGG TTTTACCGTG 
ACAATCCCTA TTAAAAATTA TATCAAAATA GCTTGAAAAT ATTGGAAAAG TATGGTAGAA 
TGAAAATTGT CGTGTGAACG ATAATACTCA TTCTTGATGA ATTGTGAAGC AGTTGCCCTT 
GGGTCGTTTT GCGAGTTGAA GTCAAGAAGA GGAAAAAAAC AAAAAGGAGA AATACTCATG 
GCAGTAATTT CAATGAAACA ACTTCTTGAG GCTGGTGTAC ACTTTGGTCA CCAAACTCGT 
CGCTGGAATC CTAAGATGGC TAAGTACATC TTTACTGAAC GTAACGGAAT CCACGTTATC 
GACTTGCAAC AAACTGTAAA ATACGCTGAC CAAGCATACN ACTTCATGCG TGATGCAGCA 
GCTAACGATG CAGTTGTATT GTTCGTTGGT ACTAAGAAAA CAAGCAGCTG ATGCAGTTGC 
TGAAGAAGCA GTACGTTCAG GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT 
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TACAAACTGG GGAACAATCC AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTGGATG 720 

GAAGAAGATG GAACTTTCGA AGTTCTTCCT AAGAAAGAAG TTGCACTTCT TAACAAACAA 780 

CGTGCGCGTC TTGAAAAATT CTTGGGCGGT ATCCGAAGAT ATGCCTCGTA CCCCAGATGT 840 

GATGTACGTA GTTGACCCAC ATAAAGAGCA AATCGCATGT TAAAGAAGCT AAAAAATTGG 900 

GAATCCCAGT TGTAGCGATG GTTGACACCA ATACTGATCC AGATGATATC GATGTAATCA 960 

TCCCAGCTAA CGATGACGCT ATCCGTGCTG TTAAATTGAT CACAGCTAAA TTGGCTGACG 1020 

CTATTATCGA AGGACGTCAA GGTGAGGATG CAGTAGCCGT TTGAAGCAGA ATTTGCAG 1078 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CTATGAGGGA 


TATAAAGTGA 


GAGAGTTGGG 


CCAACGTTCC 


AAAAACGTCC 


TAAATTCTGC 


60 


CCATTGACAA 


AGGCAACCCC 


CTTACCAAAC 


TCAGACNAGT 


CTAGGTAAGT 


ATCTTTTGGC 


120 


TATTCGACTG 


TAAAGTCATA 


AGCGTAAAAG 


GCTGGTTGTC 


CTTGAGTCCA 


TCCTTTGGAA 


180 


AAATCAATTT 


TCTCAGGATT 


GTCTAGTGGG 


AGTGGATAGT 


GTTTCCAGTT 


TAGTAAGAAA 


240 


TGCAGATCCT 


TACAGACCCC 


TGTCCGAATT 


CCCTTACGTT 


GCGTATCCGC 


TAAGAACTTA 


300 


TGCCCATAGT 


TGACACGCCC 


CATATTTTCT 


ATCAAGATAT 


CTAAC CTAGA 


TAGCCCTTTC 


360 


TTTTTACCTT 


GATAAAAAAT 


ATCTTCCCCA 


ATCTCTGTCT 


GATATTGAGT 


TTTAACCCAC 


420 


TGACCATCGA 


CATACAGCTG 


GGCCCTATCT 


CGACCATCAA 


TGATACGAAG 


TCTTTCTTCT 


480 


TCTGCATCCC 


AGTTTGTTTC 


TGTTCGATAA 


AGTAGGTAGC 


CATAACTTTG 


TCCCAGCTCC 


540 


TCCATCTTTT 


GAGGATAGAG 


ACTTTCTACA 


GGGACTTGAC 


AAGCTATCTA 


AGGTTTCAAA 


600 


CAAAGAAACT 


TTTTCAACTA 


GTGGAATAGC 


ATCCAACTCC 


ATACTCTCTT 


TGTAGAGTGG 


660 


TTCCAACTGC 


GGATACTCTG 


AAAAATGTGT 


TGCCATCATC 


TTCTTGACTG 


CAAGATATTT 


720 


AGCAGTTGGA 


TTTCCTTCTT 


CATCCAGAAG 


GGCATCGTAA 


TCATAAGACG 


TTAACTTGTG 


780 


GCAGGTCCAA 


ATTCCTCGAG 










800 



(2) INFORMATION FOR SEQ ID NO: 56: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 769 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



CTCACGCAGT 


TCTGTTACCA 


AAGACCGTTC 


AAAAAGCAAA CTAGCAATAC 


TTATAACTTA 


60 


TTAGAAATTC 


CTAATTTTAC 


CTTGCTAAAC 


GAATACAGGT CACGATGCCA TCTGAGTCTG 


120 


TGATTTCCAA 


TTGGCTCGAT 


GTCATCCATT 


TGATTGGTAC GTCAACTTCT 


TGTGCTCGTT 




GGGCAATCGT 


TAACAGTTCT 


TCTTTATGTG 


CAACTTCGAT GACATAGTAG 


GCTAAACCTG 


*5 a n 


GTAAGCCTTG 


CTTACGCGGA 


AGCCAGACCT 


TTTCCTTCCC CATTCGTTAA 


CTGCTAAATG 


jUU 


ATGATGGTAA 


TCTCCAGCCG 


CAATCCAACT 


AGCGCCAGGC ACACTGAATT 


TATTCTCTAG 


360 


CCCTAACACC 


TTTTGATAAA 


ACTGGCTGGA 


CTTTCGACTA TCCTTGACGG 


AAAGATGAAT 


420 


ATGCCCCATT 


CTTGTACCTT 


CTGCCAGGAT 


AAAGGGCTCT ACTCTTTCCC 


CCAACTCATA 


480 


AATGTCCTGC 


GCCGCAAGAG 


CCTCCGTCAC 


TCCGATAATG CGTCCATCTT 


CTCGAATATC 


540 


CCATGTGGAA 


ACTGGCTTAT 


CTCGATAGAG 


TTCAATGCCA TTTCCCTCCA AATCTTCCAA 


600 


GTAAATAGCT 


TCACTGTAAC 


CATGGTCTGC 


ACCGCCGACA AGAGGAATTT 


GCAAATCTGT 


660 


CAGATGTTTC 


AAGACATCAG 


CCAAAGCCTT 


GCGTGTGGGC AAGAGAATGG 


CCAAATGGTA 


720 


AAGACCATAA 


TGTTCCCTCA 


CTTCTCCGCT 


CTTCTTGTGC TTGAATCAG 




769 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 637 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



TTACCCCAAG CCTGCAACTT CCCAAGCTGA TTGGTTGAAT GATTTGTCAT CAACACCACC 60 

AGTATCAGTG ACGATTGCTG CTTTTGTCTT CACATCAGAA GATGAAGCTG CGTTACGAGA 120 

AGAGCGGTTA CCACATGCAG CAAGTCCAAC TGCTGCCACT GCAACTAGGC CAAGACCTAG 180 

CCATTGTTTC TTGTTCATTA CTGAACCTCC TAAATAAGAT GTGCAACGAT GTTGCAAGTA 240 

TGGATTGATT GGTCACAAAG ACCCTTGCCA CTCAAAGAGC GACTCAGACT AATTTAAGTC 300 

TGTAAAAGAA TATGGAAGTA ACTCCCCGAC CGTCATCTCG ACCGTCGATT TATCTTTTGC 360 

GACTAAGGTC ACTTTTAAAT CTTGTTCAAA AAATTCGACC ATTACTTGGC GACAAGCACC 420 

ACATGGTGAA ATCGGTTTTT CAGTTTGACC ATAGACAATC AATTCTGAAA ATTCTCTTTG 480 
GCCTTCAGAT ATAGCCTTAA AAATAGCTGT TCTCTCACCG CAATTGGTCC AAAGGATAGC 
TAGCATTTTC GATATTCACT CCCGTGTTAA ACATTTCCGT CTTTGGCTAC TAAAACTGCT 
CCGATAGGAA AGTGAGAATA GGGGACATAG GCATGTT 



540 
600 
637 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 940 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



CTGAGGATAA 


TGACATCTGC 


CANAGGAGCA 


AAACTAGCAA 


AATCATCNGT 


TGCACGGTCA 


60 


ATCATGCCTT 


CTTTCAAGGC 


GATATCTCTC 


GAAGCTTGAC 


TACGGATTAT 


AACCTAAAAT 


120 


TTCATAATCT 


GGATGATCGC 


GTTTGATACC 


AAGTGCCATA 


GAGGCTCCAA 


TCAACCCAAG 


180 


ACCTGCGATA 


TAGATTGTTT 


TTGCCATAGG 


AACTCCTTAA 


TAGTTCTTTG 


TATAGTCTCG 


240 


GTGTTTGGCT 


ACCGCTTCTT 


TTAGTTCCTC 


AAGATTATCT 


GATGAGAATT 


TTTCGAGGAT 


300 


TTCTTGCGCC 


AGAACCGTTG 


CTACAACTGC 


TTCCATGACC 


ATTCCTGCAG 


CTGGAAGAGC 


360 


AGTCGGATCA 


CTTCTCTCCA 


CGGTTGCCTT 


GTAAGGTTCG 


TGGGTTTCGA 


TATCCACACT 


420 


CATAAGAGGT 


TTATAAAGAG 


TAGGAATGGG 


TTTCATGACC 


CCACGAACAA 


CGATGGGTTG 


480 


CCCATTAGTC 


ATACCACCTT 


CAAAACCACC 


TAGATTATTG 


GTACGGCGAG 


TATAACCGTC 


540 


TTCTTTAGAC 


CAGAGAATTT 


CATCCATAAC 


TTGGCTGCCT 


TTACGATAAC 


CAGCCTCAAA 


600 


GCCAAGACCA 


AATTCCACCC 


CTTTAAAGGC 


ATTGATAGAG 


ACAACAGCTT 


GAGCCAATCT 


660 


TGCATCCAAT 


TTTCTATCCC 


ATTGGGACAT 


AGGAACCAAG 


ACCAACTGGG 


AACGCCTCCG 


720 


ACGACTGTCT 


CCACAACCCC 


ACCGATGGTA 


TCACCATCAC 


GTTTGATTTG 


GTCAATATAG 


780 


TCCTTGATTT 


CCTGTTCTCG 


TTCTTGGTTG 


ACAATAGAAA 


ACTTCCAGAC 


TGGGCAGCTC 


840 


TTTGCTTTAA 


TTTCCAGCGA 


CTGTCAGATT 


TTCCAGGAAC 


ATCCATTCCC 


TTGCCACCAA 


900 


AGACCACGAC 


ATGGTTGGCA 


ATCTCCATAT 


CCAGATTCGA 






940 



(2) INFORMATION FOR SEQ ID NO: 59: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 



GATTCTACAG 


GATCCATTTT 


ACTATCTTAC 


GCGCCGGGAA 


GTAGGCTGAG 


ACATAACCAA 


60 


GTAATAGAGC 


CAACAACTAG 


AGTTCCTAAA 


ACAGATAAAA 


GATTTAATTC 


AAAAACCTTA 


120 


GTGATGGATG 


GGTAAAAGTG 


ACTTACAATC 


GCATTCGCCA 


AACTTCCCAC 


CCCTTGTGCA 


180 


ACCAAAAATG 


CCAGCAGCAA 


GGCGATGCCT 


ACAATCCAGA 


TAGCCTCGTA 


AATAAAAATT 


240 


CCTTTGACAT 


CACGATTCTG 


ATAACCAACT 


GCTTTCATGA 


CACCTATTTC 


CTTGGAACGT 


300 


TGCATGATAT 


TGATGTAAAT 


AATGATACCA 


ATCATAACCG 


CTGCTACCAC 


AATAGCTTGT 


360 
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GATGAAAGCA CAATCAATAA TCCCTGAATA 
ACTCTCTGTT AAGAAAGCAC AGTATACTTC 
TTTGTCTGTG ATGGATCTTT GAGTTCCAAG 
GCCTCTTTCA AAATCGTTTC CATTTGATGA 
TCATCTTCAT CATTGATTAC ACGTACAATC 
TCGGCAGCAC TTTCTACAAT GCTGACTGAG 
AAATTTTTTC CTGTCTGTTC ATTCCAATTT 
CTGTTCATTT GTATCAGTAT AGAGGGATCC 
AACAGAGATA CTTGTATCAT CATAAAGACT 
CTCAAATCCA TTTCTTGCCC ATCTTTTCTT 
TCCCNAAAGG ACTCTCCAAA TATNNNATAG 



PCT/US97/05306 

ACACGAATAA AGGTAATCAC AATATCAAGA 420 

TTATTTTTCT GTAATTCTTC TGTTACTACT 480 

ATAAAATAAG ATACAGCTTT CGTAAATCCA 540 

GACAGCATGA AACTGTTGCT GTCCTCCATG 600 

TTCGTTTGAA* ATTGAGCAAT CTTACTAGTT 660 

ACTGATTTGC CAATAAGATC ATTAGCTGTC 720 

TTTAGTAAAG CTGCTTGGAA TCGTTAATCC 780 

AGCCAACACT TTGTCCGTCT CATTATTACT 840 

CACTACTTGA GCATAAGAAG CATCGTTTGA 900 

GCCCATCTAT AGTAATATTT GACATGTTCA 960 

990 



(2) INFORMATION FOR SEQ ID NO: 60: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



ATCGGCTAAG ATGCGTGGAG CATCACAAAT TATCCTTATG AGCCGTCATG AAGACGTCAA 60 

AAGATGGCTA TGGAGTCAGG TGCGACAGCT GTTGTTGCAG AACGTGGTCA AGAACGAATT 120 

ACCAAGGTGC GTGAAATCCT CGGCGGAGGA GCAGACGCAG CACTTGAATG TGTTGGTACG 180 

GAGGCTGCTA TAGAACAGGC GCTAGGTGTT CTTCATAATG GAGGGCGTAT GGGCTTTGTA 240 

GGAGTCCCAC ACTATAATAA TCGTGCTCTT GGTTCGACAT TTATGCAAAA TATCTCTGTA 3 00 

GCAGGTGGGG CAGCTTCTGC TACAACATAC GATAAGCAAT TTTTACTAAA AGCCGTCCTT 360 

GATGGTGATA TCAATCCAGG TCGCGTCTTT ACTTCAAGTT ATAAACTGGA AGATATCGAC 420 

CAAGCCTACA AAGATATGGA TGAACGTAAG ACCATTAAGT CTATGATTGT AATCGAATAA 480 

AAAACGAATA GGAGTTTTAG AACTCTATTC GTTTTTTATG TTATCCTATT CTTGATTTAA 540 

GGTACTTTCT CTTAATGTCA GTCTGGTTCC CAGCATGGTC AGGCTAGGGA TTTTCCGACC 600 

GTGGAGGACT TCCTTGTTAA GAATATCCAT ACCTGCTCGG CCCATTTCTT CAGTATAAAC 660 

TGTAATACTA AGAGAGGGGA GGATAGACCT GTTTGGTCAG ACTAAGTGTC GTTAAAAGGA 720 

AAATGAGGCT GACGCGATCT GGCAGGCTGA TTCCAGCTTC TTGGAGGGCA CGGAGGGCAC 780 

CGATAGCTAA ACTATCGCTG GCTGCGAAAA ATGCTGGCGG AAGTTGGTC T CCCAAGCTCT 840 

GAATGGCCTC CTTCATTAAG TCATAGCCAG ACTGGGCAGT AAATCTTCCT TGAAAGACCA 900 

GTTCATCATG ATAGATTCCC CTCGCTTGAC TGTAGTTTTT GAAGTTTTCT AGACGCTTGT 960 

CCTGAATGAT TTCTTCTTGG TCTGTAGTTT CTTCAAGGCC TGTTAGAATC CCGATACGGT 1020 

CCATTCCTTG ACTGAGGAAA TAATCGACAA CCTGTTTCAT AGCAGTGTAA AAATCCGTGA 1080 

TAATACAGGT ATGTCCCAGG GAAAGTGTAT CGCTGTCTAG AAATACAAGA GGCTTTTGGT 1140 

ATTCTTCAAA GGCAGAAATC TGAGCTTCTT GATAGGATTG ATTCTGCTAG GGATAACAGC 1200 
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CAAAGGCAAT CCTCCCTTTA TCCACCTCAT ATTTGAAACC ATTTCAGCTC TTA 



1253 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



CGTGTACCAT 


CAAGAGGCTA 


ATTGATGTTT 


CTTCCCCCTG 


TGCCTCTCTT 


GTCAACATTT 


60 


CTGTTTGAGC 


CTTATGCGCC 


TGTAATAAGG 


CAGCATTCGC 


ATCAGCTAAT 


CTTCGATTGG 


120 


CTTTAGAGAA 


ATCCCCTTTT 


TTTGCTGCCT 


GAATAGCCTC 


CATAGCATTA 


CTTTTGGCTT 


180 


CCCCACCATA 


CATAATTAGC 


CCCATTACAG 


ATTCTAAATT 


ACTTTCATCC 


ATAACATTCT 


240 


CCTCAATCTT 


AGTCTAATAA 


ACTTTCAGCT 


AAATCTAAGA 


CTTTAGAACC 


ATTCATCATT 


300 


CCATAATCTG 


TCATCGGAAT 


AACCGCCACA 


GGAATCTGTC 


TATC TTTTAG 


TTTTTCTTGA 


360 


AAGTCCCCTA 


GTAAATAGCG 


AACTTGAGGA 


CCTAAAAGCA 


ACACATTTAC 


TTCTTTTGTT 


420 


GCTACAATTT 


CTTCTGCTTC 


AGGAGCTGGA 


ACTGCAAAAA 


TTTCTGCATC 


CAACCCCTTA 


480 


TCTTCTGCTG 


CCTTTTGCAT 


CTTTGTCACT 


AGCAGACTTG 


TACTCATACC 


TGCTGCACAT 


540 


GCTAACATAA 


TTGTTACTTT 


AGCCATAGCT 


TACTCCTTTA 


TCACCTATTT 


ACTGACCTAG 


600 


CCAAGCGAGA 


TACTGTATGT 


CTCACTCCTC 


TAATTGCCTT 


GCTCAATGCA 


TAAATATTAT 


660 


CGACTGTTGC 


CAGTCCCCCA 


TATCCTGCGT 


CACCTATATG 


TTGAATATCA 


ACTCCACAAA 


720 


TTTTATTTCT 


AAGTGCAATT 


TCCTTAATAG 


TATCTGTATC 


AGATGTTTCT 


TGGCTAGTAC 


780 


CAATAGCACT 


CAATACAAGA 


CCACCCTTAC 


TATGAACGAG 


ATCAACGACT 


TCACGCAACT 


840 


CTTGGTCATG 


AAAGGCTGGA 


ACGGTTCCAA 


CTGCTGGAAC 


AAGTATCACA 


TCCGCACCTG 


900 


CTTCCAACAA 


TTGCTCTGCG 


ACAGAAAGCT 


CTGCCACAGG 


CTCATTCACT 


CCTGCCCCGT 


960 


GCATCTTACC 


TGCAATAATC 


AAACCAGAAA 


AGTTTTCCTT 


AGCAGTTTGA 


ACAGCCTTAA 


1020 


TGATTTCTCG 


ATTGCTAACT 


CCTGTTCCAG 


GATTTCCAGT 


CAAACAGACA 


AAGTCAAAAC 


1080 


CTAACTCCTA 


TATTCGCTTC 


AATGTTTCAA 


CACTAGCAAC 


ACGACCTGCA 


ACAATTTCCT 


1140 


GTGTTTCCTC 


TAGCATTCTT 


TGCAGATGGG 


TCAATCGGTT 


CCAAATTTAC 


ACCAATTGGA 


1200 


CAAGCAACAA 


G 










1211 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

94 
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<ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CCACACTAGC AGCCTATGGA CTCACTAAAA GAAAAGAAGA CTAAGTCTTT TCGATAAAAA 
ATAAACAGCG AGATTGAAGC TCGCTGTTTA TTTTTTAATT AATCACCTAG TCCAAGACGT 
TCAAAGATAT CATCCACTCG TTTGGTGTAA TAAACTGGGT TGAAGATTTC ATCGATTTCT 
TCTTGTGTGA GACGTGATGT TACTTCTGAA TCTGCCTCAA GAAGTGGTTT AAAGTCTACT 
TGGTTGTCCC AAGAGTAGGC TGTTTTTCGT TGCACCAAGT CATAGGCTTG CTCACGGGTC 
ATGCCTTTTT CAATCAATGT CAACATAGCC CGTTGGCTAA AGATAAGACC AAAAGTCGAG 
TTCATGTTTC GGATCATATT TTCTGGGAAG ACTGTCAAGT TCTTGACGAT ATTTCCAAAA 
CGGTTGAGCA TGTAGTCAAT CAAAATGGTC GTATCTGGTG TGATGATACG CTCAGCTGAT 
GAGTGAGAAA TATCGCGTTC GTGCCAGAGA GCGACGTTTT CATAAGCCGT AATCATGTGA 
CCACGAATGA CACGCGCCAG ACCAGTCATA TTTTCAGAAC CGATTGGGTT GCGTTTGTGA 
GGCATTGCTG AAGACCCTTT TTGCCCTTTA GCAAAGAACT CTTCTACTTC GCGTTGCTCA 
GATTTTTGTA GACCACGAAT CTCAGTCGCC ATACGTTCGA TTGAAGTCGC AATGCTGGCA 
AGAACCGCAA AGTACTCAGC GTGAAGGTCA CGAGGAAGGA CTTGTGTTAA AGATTCCTTG 
GGCACGGATG CCAAGTTTAT CGCAGACATA CTCCTCTACA AATGGTGGGA TATTGGCAAA 
GTTCCCAACC GCACCAGAAA TCTTACCAGC TTCTACACCA GCAGCCGCAT GCTCGAAGCG 
CTCCGATATT GCGTTTCATT TCGCTGTACC AAGTTGCTAA TTTAAGGACC AAAGGTTGTC 
GGCTCAGCGT GCACACCATG AGTTACGCCC CATCATGATG GTGAACTTGT GCTCCTTGGC 
TTGTCAGCGA TGATATTAGT GAAGTTTTCA AGGTCACGAC GGATGATGTC GTTGGCCTGC 
TTGTAGAGGT AACCATAAGC AGTATCCACC ACGTCGGTAG AAGTTAACCC ATAGTGAACC 
CACTTGCGCT CTTCACCAAG AGTCTCAGAA ACCGCACGCG TGAAAGCCAC CACATCGTGG 
CGCGTCTCCT GCTCAATTTC CAAAATACGG TCGATGTCAA AGTCCGCCTT CTTGCGAATC 
AAAGCCACAT CTTCCTTAGG GATTTCCCCA ACTCAGCCCA TGCCTCGTCA GAGAGGATTT 
CCACCTCAAG CCAAGCACGG TATTTATTTT CTTCACTCCA AATATTCGCC ATCTCAGGGC 
GAGAGTAACG GTTGATCATG TGTTAATTTT TCCTTTCTTC TTAAG 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

CTATTCTGAG TATAAAGTTG GCGGAGGCTG GAACTACGCT CGTTATGAGG TCATAAACTA 
CTATACTGGA GGTTATTAAT CCTTAAAGAG TGAGAAAAGG AGGGCTGGAT ATGTTAAATC 
TTACTCATGT TACCTTAAAA ACGCGACAAG TCATCTTGCA AGATGCGGAT TTTACCTTTA 
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AAAAGGGTAG GATTTATGGC CTTCTTGCTA 
GAGCTATGAG CAAGTTGCTT CCCCTTAGTA 
TTTATTATGA GAGCGTTGAA TGGCTGGATG 
TCATAAAAAA ACATCTGGAA GTCAGACCTA 
ATGGCTGACT ATATCAGTCT TCCCATCCGC 
GTGATTGCTA TGTATTTTCT CAGTCAGGCC 
GGCTTAGACG AGTATTATAG ACAGAAGTTT 
GAACAGCTGG TTCTTTTAAG TTCCCACTAT 
GTAGTAACCA TTCATCAGGG GCAGATAGAA 
TTTTTATTGA AAAAAGTTTT CAAAAGTCGT 
TCTGTACTCG GTGTTACCTT TTATTTAAAT 
AGAGAGTTGG AAACTCGTCT TGTAGACCGT 
CTCTCCCAAA TGTCTGATAC CAGCTCGGAG 
GTGCAAAAAA ATCTTTTGAC GCGAAAGACA 
TGGAAAGAGG CCTACTATCT GCAGTGGCAA 
AATGACCCGA CTGCTAGCTC TGGCTTAAAA 
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TCAATGGCTC 


GGGAAAGACG 


ACACTATTCC 


240 


GTGGACACAT 


CGC AGTTC C T 


CCTTCTTTGT 


300 


UAAAL I 1 AA<J 


TGGGATGGAC 


TACCTTCGTC 


ibU 


AAv. 1 iVAoAu 


A TV^ AAA mp/^P 

A l\xAAATCGC 


CTACTGGGAA 


a o n 


AAG 1 A 1 1 L.V- X 


1 AubbA I VjAA 


IT* A A ^^fTfTV* 

GCAACGC 1 1 vj 


/ton 


AAA 1 G\- 1 VjvjU 


1 LA I GGATGA 


GATTACAAAT 


b4 U 


TTTGA 1 AGGL 


I AGCAC AAAT 


CGATAGACAA 


^ A A 

600 


AAGGAAGAG 1 


1\JG 1 TGATAT 


CTGCGATAGA 


C C ft 

6b0 


GAGGTTTAGT 


TTATGAAAGA 


TGTTAGTCTA 


720 


TTAAACTGGA 


TTATCTTACT 


TTTATTTGCA 


780 


AGTCAGACTG 


CAAACTCAGT 


CAGCTTGGAG 


840 


GAGAGAGTCA 


TCAATGAAAA 


TGAAGAGAAA 


900 


GAATACCAGT 


TTGCTAAAAA 


TAATTTAGAC 


960 


GAAATTCTGA 


CTTTATTAAA 


AGAAGGGCGC 


1020 


GATGAAGAGA 


AGAATTATGA 


ATTTGTATCA 


1080 


ATGGGGGTTG 


ACCG 
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(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



CAGAGCAACT 


TCGTCAAATC 


TTCATAGGTG 


AGGTAACCAA 


TTGGAAAGAG 


GTTGGTGGTA 


60 


AGGACTNACC 


CATCTCTGTT 


ATCAATCGGG 


CAGCCGGCTC 


TGGCTCTCGT 


GCTACCTTTG 


120 


ATACTGTCAT 


TATGGAAGGT 


CAGTCTGCCA 


TGCAAAGTCA 


GGAGCAGGAT 


TCAAATGGAG 


180 


CGGTAAAATC 


AATCGTATCA 


AACAAGTCCA 


GGAGCTATCT 


CTTATTTATC 


TCTTACCTAT 


240 


ATAGATGATT 


CGGTCAAAAG 


CATGAAGTTG 


AATGGCTATG 


ACTTAAGTCC 


AGAAAATATA 


300 


AGTAGCAATA 


ATTGACCCTT 


GTGGTCTTAT 


GAGCATATGT 


ATACATTGGG 


GCAGCCCAAT 


360 


GAGTTGGCTG 


CAGAATTTCT 


CAATTTTGTT 


CTCTCGGATG 


AGACCCAAGA 


AGGGATTGTC 


420 


AAAGGATTGA 


AGTATATTCC 


GATTAAGGAA 


ATGAAGGTTG 


AAAAAGATGC 


TGCCGGAACT 


480 


GTGACAGTGT 


TGGAAGGGAG 


ACAATAATGA 


ATCAAGAAGA 


ATTAGCTAAG 


AAAATGTTGC 


540 


TTCCATCAAA 


GAATTCTCGT 


CTGGAGAAAT 


TAGGAAAAGG 


TTTGACCTTT 


GCCTGTCTTT 


600 


CTTTGATAGT 


CATCCTTGTG 


GCCATGATTT 


TGGTTTTCGT 


AGCGCAAAAA 


GGCTTGTCGA 


660 


CCTTCTTTGT 


CAATGGTGTG 


AATATCTTTG 


ACTTTCTTTT 


GGGAGGAACT 


TGGAATCCTT 


720 


CTAGTAAAGA 


ATTTGGTGCC 


CTTCCTATGA 


TTTTGGGTTC 


CTTTATCGTT 


ACCATTCTCT 


780 


CAGCCCTTAT 


CGCAACACCC 


TTTGCTATTG 


GTGCAGCAGT 


TTTTATGACC 


GAAGTATCAC 


840 


CAAAAGGGGC 


GAAGATTTTG 


CAACCAGCTA 


TTGAACTCCT 


GGTTGGGATT 


CCTTCAGTAG 


900 
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TGTACGGATT TATTGGCTTG CAAGTCGTCG TTCCCTTTGT TCGCAGTGTC TTTGGTGGGA 960 

CTGGTTTTGG GATTTTGTCA GGGATTTCCG TCCTCTTTGT CATGATTTTG CCGACCGTAA 1020 

CCTTTATGAC AACGGATAGC TTGCGTGCGG TTCCTCCNTT ATTATCGTGA AGCCAGTTTC 1080 



GCTATGGGA 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



1089 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



CTNTNTTTAA 
AAAAAAGGGC 
AGCCTAATTC 
CTTCAGACCA 
AGATGCGGTC 
GCTGCAACTG 
CCAAGGCGCT 
GTAAGGCTAC 
CTGTCAGCTG 
CCTTTTGAAT 
GATCCTTTTG 
TTCTCCCTGA 
CACCACTTGG 
CTAAAATCCT 
TTTCACAAAT 
TTTATCCCAT 
GTCGCTTGCG 
CTTGAGTTCA 
ATATTGGTAG 
CAGATAGTCG 
TTTCAANTAC 
GTAGTAAATC 
NTTANTTGGT 
CAACATACNN 
GGACTGCCTA 
AG 



ACCTTGACCT 
GCTGGTAAAA 
TGCCAAGATT 
GTCTAGTTTA 
GCTGAGANTG 
CTTGTGAATC 
AAAGGCCTCA 
ACGCTGGCGC 
GAAGGTCGCA 
CAAGAGGGGC 
CAACATATAA 
CTGAACTTCT 
GCCTAGGATG 
CTCCTGACCA 
TCGTTGGTGA 
TTATAGAAAG 
TATTCTTTTG 
GGTGCATTCT 
CCTTTTNTGA 
TTGTTTGCGA 
ATGAAGTTAG 
CNAGCAGTAT 
ACTNTTTCAA 
GTTCANTTGG 
CATTNTCACA 



TANTTGANNA 
GGGATAATCT 
TGACGTTTGT 
ATTTCTGAAA 
AGGGCCTCCT 
TCAAGATACC 
TCTAAGAGAA 
ATCCCACCGC 
AGAATTTTAT 
AGAATGATAT 
CTCACGTGCC 
AAAATCCCAG 
GAAACCACTT 
TAGGCATAAC 
AGCCTTTGTC 
CATTCCAGCG 
ACAAGTATTT 
TGATGAGAAT 
TGGCTTGGAT 
TGATAACTGG 
CATCTACACC 
CAAAGACGCC 
CCTTCTCACA 
GTCATTCCAT 
TTACCGAGAG 



TTATACCGAA 
TCACCAACTC 
NGGCAATCTT 
CAATCTGCCC 
CAATACTATG 
AAGCGTGGAG 
AGAGCTTGTG 
TAAGTTCATG 
CCGCTCGGGA 
TTCCAAGCAC 
CCTTGGGATT 
CGATTAGATT 
CGCCTGAAGT 
TGACGTGCTC 
TGTCAAGTCT 
AGCTGCGTCA 
TTGAGATTCG 
ATCTGCAGCT 
GACTTTGCGA 
TGAATATTAN 
TTGAGATTTA 
ATTGGCAATC 
GTCTCCNCCT 
GTCCCATATT 
ATGATCCCTG 



ATTNTCTTCA 
CCTATNTTTC 
TNGGACTTCN 
AGGGNGATTT 
CGTGANGATC 
TTCCATCTTT 
CCCAAAAAGG 
AGGATACTTG 
AATAGCTTCT 
CGTCTTGTGC 
TTCTTCACCA 
AAAGAGGGTG 
CACCTGTAGG 
TAGTCTAATT 
TCTTTAAGGA 
AATTGACCCC 
ATGACAAAGT 
TCTNCTGGAT 
GCTTCTTCTT 
TCAAACTCCT 
GCAAGGATAC 
GGTGTGATTG 
TGAGATTCTA 
TCTTACCAAC 
ATGTATTGTG 



TTTTTAAAAG 
TACTTATCTA 
TNGTCCTCAT 
NTCAAGATAT 
AGGGTTGTTA 
GTCATCTCAT 
TAGGTCCGGA 
TCTCTTACAG 
GCCTTATCCA 
TCCAAGAGCA 
TCAAGGACAA 
GTCTTTCCAA 
TTGATATCCT 
TCTGTCATTA 
TACCATTTTC 
ATTTTTCCTT 
CACGTTTTTC 
GTTCCATGGC 
TGTTATCTTT 
TGACATAGTC 
CATCCCAACC 
AGTTTGAGTC 
CCAAGGTTTT 
CAAGTCTTTT 
TTCCACCNAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1502 
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(2) INFORMATION FOR SEQ ID NO: 66: 



PCT/US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



ACTCTTCAGC 


GATTTCTTGG 


CTGAGAATGT 


CCTCATAGGC 


AGGGAGTTGG 


ATGTCCTGAC 


60 


CATATTTCTT 


CTTGAGGGCA 


GTAGTAACTA 


GGCGATAAGC 


CAGATTGGCA 


TTACGAGAGA 


120 


AGATAGGCCC 


GNGGAANTAC 


GAACCAAAGA 


CATTCTTATA 


ATGAACCCCT 


TCACCGACCT 


180 


TTTCTTCGTT 


GTTTCCATTT 


CCATAGACAA 


CCTGTCCCAG 


CGGTTTTTGG 


TCATCAGAGA 


240 


GGAAGGTACG 


ACCTTGGTGA 


TTTTCAAATC 


CATAGTAGGT 


TTCATCGAAA 


TCTTTCATTG 


300 


TGAATCTTGA 


TGTCACCGAT 


AAAACGGTTA 


TTGGTCTGGT 


TGAGCGTGTA 


GTGTCCCATG 


360 


ACCCCTAGCC 


CTTCGATACG 


TTTTCCTGAA 


GCTTCAACAT 


AATATTGACC 


CAATAGTTGG 


420 


AAACCACCGC 


AGATAGCCAG 


AACTACACCG 


TCGTTTTGGA 


TGTAGTTGTC 


AATGCTCTCT 


480 


TTTTTAGCAG 


GTAGGTCGTC 


TGCAATGATA 


CTTTGTTCAA 


AGTCTTGACC 


ACCACCGAAA 


540 


AAGGCGATGT 


CGTAGTGATT 


TTCATCAAAG 


TCATCATGGA 


GAGAAACGAT 


GTCAACGGTC 


600 


ACATGGGTTC 


CCAGTTTTTC 


AGCCACATAC 


TTGAGCATGA 


GGATGTTTCC 


ATTGTCCCCG 


660 


TAGGTATTCA 


TGAGATTTCC 


GTAGAGGTGG 


GCAATGTTGA 


GCTGATAGGG 


GTAATTGCCA 


720 


TCTTTTGAGG 


AAAGTGAAGT 


ATAAACCATT 


AGTTCATCTC 


CTTTCTAACA 


ATCTGACGAC 


780 


TAGCCAGCAG 


TTCACGAAAT 


TCCAGCATGG 


CAGTATAAGT 


TGCCAGAATA 


TAGGCATGCT 


840 


TGCAGTCTTG 


ATTCTCAATG 


GTCTTGAGAA 


CTTGCTCCAG 


ATTACTCGTT 


TCAGTGATTT 


900 


TCTCAGCTGG 


ATAGCCAGTC 


ACTCGGAGGC 


GACGAGCGAT 


TTCAGAATGA 


CGAACACCGC 


960 


CAGCGTTGAT 


TTCAGGAATG 


TCCATGTCAG 


TGATTTGCTC 


AAAGTCTGCA 


TCCCAGATCC 


1020 


AG 
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{2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
CCTCCAGGTC TCGGGAAACG ACCATGGCCT TTGTTATTGC CAACGAACTG GGAGTCAATC 60 
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TTAAGCAGAC 
AGTTAGATCC 
AAGAGGTGCT 
AGGGTAGTCG 
GGGCTGGTAT 
ATTATGCCCA 
AAATCACTCA 
CCAATCGTCT 
ATGATATTAT 
ATGTGGATCA 
TAGGAACTCT 
CCTTACTTTG 
GCCTAAGGCA 
GAGGCTTTTT 
TCTGAAAGAC 
AGACAAATCC 
TTTTCTTATG 
CAGTGGGAAT 
ACCAGTTCTC 
TTGAATGAAG 
TTCAAGTTCG 
CACGTTTATC 



GTCGGGTCCA 
TGGGGATGTA 
TTATAGTGCT 
TAGTGTTCAT 
GCTCTCCAAT 
TGCTGACTTG 
TGAGGCAGCA 
CCTCAAGCGC 
TACCGATAAG 
AAAAATCCTT 
TTCTGTTGAA 
ATTCAAAAAG 
TATGAGCACT 
AGAGAAAATC 
TCGTGGTTGG 
CCTTTTGATC 
AGGAAACCTT 
TGAAAAATCA 
GTGATGCTAT 
AATCAGATTT 
TCCAACTCCT 
CAAGAAAAAT 



GTCATTGAAA 
CTTTTTATTG 
ATGGAGGACT 
TTGGAGTTAC 
CCGCTACGGG 
ACAGAAATTG 
TCTGAGTTGG 
GTGCGCGATT 
GCTTTGACTA 
CGTACCATGA 
TATCGCCGAA 
GTTTTATCCA 
TAGGTTATGA 
CAGATATGAT 
CAGCAGGTTC 
ATGAAACAGA 
ATTACTGGAG 
GGTCTATATG 
GAGTAAGTAT 
TGAACTCTAT 
CATTTCTTAG 
TGGCAGGAGA 



AAGCCGGAGA 
ATGAGATCCA 
TCTACATAGA 
CACCTTTTAC 
CACGTTTTGG 
TCGAGCGGAC 
CCCTACGTAG 
TTGCCCAGAT 
TGCTGGATGT 
TTGAGATGTA 
GAACGTGAGA 
TGCGGACACC 
ATACAGTGAA 
GGCCATTCTG 
TGTCAGAAAT 
TATAGATGTT 
AAAAAGCTGT 
CATCAGCACA 
CCAGAACGGT 
GCACCTTATG 
AAAATGAAGA 
AAT 



TCTGGTAGCT 
TCGTTTGCCA 
TATTATGATT 
CTTGATTGGT 
GATTACAGGC 
GGCAGATATT 
TCGTGGGACC 
AATGGGGAAT 
TGACCATGAA 
CAGTGGAGGA 
CAGTTGAAGA 
GTCCTGGACG 
AAATAAGCAA 
ACGATCATCC 
TTCATTTGGA 
GATTTTCTTT 
GAGAGGATTT 
GTCCTCACAC 
GTACGACAGT 
GTTTGGAGGA 
CCGAATGGAA 



PCT/US97/05306 

ATTTTGAATG 120 

ATGTCAGTGG 180 

GGGGCTGGTG 240 

GCGACGACTC 300 

CATATGGAGT 360 

TTTGAGATGG 420 

CCTCGTATTG 480 

GGGGTAATTG 54 0 

GGTTTGGACT 600 

CCTGTTGGTC 660 

TATGTATGAG 720 

GGTGGCGACT 780 

GAAATCCTTA 840 

GAGACCTTGG 900 

ATCTTTTGTC 960 

GATCCCAGAT 1020 

TCCTCAGTAC 1080 

TGCTTCCTAT 114 0 

TGGACTGCGC 1200 

TATTTTGAAT 1260 

CTCTATCAAA 1320 
1353 



(2) INFORMATION FOR SEQ ID NO: 68: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68:. 



CTCCTGGAAA TTCTGGCAAG TAGGCTGGAT GAATGTTGAC AATCCGACCT TCATAAGCCG 
ACAATAAGGT TGGTCCAACG ATTTTCATGT AGCCTGCTAG GCAAACCAAG TCAATCTGGT 
GTTCTTCCAA GAGTTCGACA AGGGCTGCTT CGTAGTCTGT CTTGCTCTCA AACTCCTTGA 
GTTCAAAAGC ATAGGACAGA ACGCCGAGCT GCTTTGCACG CTCAAGCACA TAGGCATCAC 
GATGGTCTGA AAAGACAAAC TCCACTGGAA ATTCTTCGGC AATCACCTGA AAATTTGAGC 
CATTACCAGA GGCAAAAACC GCTATTTTTT TCATTTGATA ATGACACTTT CGTTTTCTTT 
CTTGACGATG CGACCAATTT CATAGACTGC TTCATCCAAC AATTCTTTTA CACGCTCTAC 
ATTTTCAGGG CTGACCGCCA ACATAAGTCC CACACCCATA TTGAAGATTT CAAACATTTC 
TTCGTGTTTA ATCTGACCAT ATTTTTCAAG GGCTTTGAAA ATTGGAAGCA CTGGAACTTT 
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GCTTTCATCA ATTTCAGCAG 560 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 860 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



CTTCAACACT 


TTATTTTTAT 


CTTTCTCCCA 


TTTCGTATGC 


TAGTTCGGAA 


ATTGTTGCTA 


60 


GAACCTCTCC 


GACTATTTGG 


GATGTTCTCA 


TTGCTTTTGT 


AGGAGGGATA 


GCAGGTATTA 


120 


TTGGTGCTAG 


GAAAAAAGAG 


ACCAATAATA 


TTGTTCCTGG 


TGTTGCTATT 


GCAACCGCCT 


180 


TGATGCCTCC 


TCTTTGTACA 


GTAGGTTATG 


CTATTGCTTC 


TGCTAATCTA 


AAATTTATCA 


240 


TTAGGCTCCT 


CTTACCTATT 


CCTCATCAAT 


TGTAGCTTTA 


TTGTCATTGC 


GACTTATATA 


300 


GGTGTTAGGT 


TGATGATGGT 


TAAGAAACAT 


TATTTTAAAG 


ATAATGAAGA 


AGACTCTAAA 


360 


ATGCGTAGGA 


TTTTGCTTCT 


AGTTGCTGTT 


TTGCTGATGA 


TTCCGAGTTT 


CATCTCTGCA 


420 


ACGACTTTAG 


TGAGAGAAAC 


GTTGAAAAAA 


GAGTCCCTTA 


AGAAATTTAT 


ATCAGAGCAG 


480 


TTTCAGGGGC 


ATAATATTTT 


GAAAAAAACC 


TATTCTAAAA 


AGACTCATAC 


CCTAAAGCTA 


540 


ACCATTTCAG 


GAAATTATTT 


GACAGAAGAA 


GAACTCGATA 


TGATTTCAAG 


TAAGAGAGGT 


600 


GACTATGGTT 


TAAGTGATGT 


TTCTGTTCAA 


GTTTCACAAT 


TGTCTGATTC 


AGAACAACTT 


660 


AGTAAGGAAG 


AACTGGTGGA 


GTATTTCTTC 


CAGTATATCA 


AGGATAAGGA 


AGCAAAAGAA 


720 


AAGGAAAAAG 


CTAATAAGTT 


TTATACAGAG 


TCTGAGGAGC 


AATAATTTCT 


TGAGAATAGC 


780 


TGGTTTTTCT 


CGTGAGTCTT 


CTATGTATAT 


CAAAGGAAGA 


CTGAGGTTTT 


AAGTATGAAA 


840 


CTTTTCTTCT 


ATTATAGTAG 










860 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

TCTTGGCCAA CTGCATGGAG TTCAGCGGTC AATTTCAACG CACCTGAGAA ACAGACCCCT 60 
GCACCCCTGA AATCTCAGGA GACATGATGG TCTGGATGGA ATCAATAATG AGAAAGTCTG 120 

100 
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GCTGGATACG CTACCACTTC TGCACGAACA CTCTGCATAT TGGTCTCTGC ATAGAGATAA 180 

AACTCACTAT CAAAATCACC TAAGCGCTCT GCACGTAGTT TAATCTGCTG GGCAGACTCC 240 

TCCCCACTGA CATAGAGAAC TGTCCCCACT TGGGACAACT GGGTTGAGAC TTGTAGGAGA 300 

AGAGTTGATT TCCCAATCCC AGGATCCCCA CCGATGAGGA CGAGACTTTC CTGGTACAAC 360 

TCCGCCTCCA AGCACACGGT TGAATTCCTC CATCTCCGTC TTGGTTCGAT TGACATTGAT 420 

GGAAGTCACC TCAGCTAGTT TCATGGGCTT GGTTTTCTCA CCTGTCAAGG ACACACGCGC 480 

ATTCTTGACC TCGGCAACCT CAACCTCTTC CACAAAAGAA GACCAAGACC CACAGTTGGG 540 

GCAACGTCCC AGATATTTAG GGGAATTATA CCCACAATTT TGACATACAA ATGTCGCTTT 600 

TTTCTTTGCG ATGACAAACC TCTTTCTATA TCTCTAACTC ACACTCAATC ACTTGGCAAA 660 

AATCAATCTT CTCATTTGGC ACAAACTGGC GCATGAGCAT TCGATGAGCA ACAACTACCA 720 

CAGTCTGATG TTCTCGATAC TTAAACATAC ATTCTAGAAA CCGAGACTTC ATTTCCGTAG 780 

CTGTCTCATA TTGAATAGGA CTATTAGGAA GCAACTCCCC CTTGTTTTCT AAAAACAGTC 840 

TTCTAGCTGT TTCAAAGTTT TCTATTCCTG TTTTATAGAC CTGCCATTCA TGTAATAAAG 900 
GCTCTACTCT TAAAGGAAGA CCCGT 



925 



(2) INFORMATION FOR SEQ ID NO: 71; 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 630 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

TTCTNGATCA AGTCAAGGAC TGCANGCCAC GACCGACATA GCGTTTGGCA ATGGAAACAA 
CCAAACGAAG ATTGGCTTCC GCAAGACGTT GTTTGGCTTC GATATCGCCA GCTTCAACAG 
CCAGTGCCAA CTCTTTCTCC TCTTCATTGG TCAAGAGAGG AACGACCCCT ATTTCTTTCA 
AGTACATACG GACAGGGTCA TTGACCTTAG CAGAAGTTGA CCCAATCAAG TCCTCATCGC 
TGAGTTCTGG TTCTTCTTCA TTGCTAAGAA CACGCGCACT TGGATTTCCT TCGTTATCTG 
TGATAGAAAT GCCTGCATCC TGAATCCGTT GCAAGAGATC TTCAATCCCA TCAGCGTCCA 
AGGTAAAAGG AATAACCAGA CTTGCATTGA TTTCATCATC TGTTGCTGTC CCTTTTTGCT 
TATGATTACG GATAAATTCT GCTACCTGTA CGTCAAATGT TGTTACTTCT TTTTGTTTTG 
TTGCCATTAT TACTCCATTC TTCTCTTTTG GGAAATTAAA CGTTCCAATT CTTCTAGGGC 
TGTATCTGTA TCTCCTACAT GGCTAGCTTC CTGCACCTTC TTTTTGATTC CCATATTGTC 
CTGATTCAAG AGAGCCTTGT TTCGAAGTCA 

(2) INFORMATION FOR SEQ ID NO: 72: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
630 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

101 



WO 97/37026 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



CTAGGGCCAT 


CTTCTCACCT 


CCTCCTTGTC 


AGTACATTCT 


TGCAATAGAA 


AAAATAAGAT 


60 


TGAGTCCCCC 


CAACCTTAAA 


TTTTTTCACC 


ATCTTCTTTT 


TCTTTAGCAA 


TTTGCTCTTT 


120 


GATTTTCTTT 


TCTTCTTCTT 


CTTTGCGGCG 


TTTTTCTTCT 


TCGATACGGC 


GACGCACTGC 


180 


TTCACGTTTT 


CCTTCTGGAT 


CTGGGTGAAT 


TGTAACGTTT 


CCTGATTCGA 


TTTCTTCTAA 


240 


AGCGCGAAGA 


GTTGATTTTT 


CAGACTTGAA 


ACCTTGAGTT 


GCTGGGGCAC 


CTGCTTCCAA 


300 


TTCGTGGGCA 


CGTTTTGCTT 


CCAAGATTAC 


GAGTGAATAT 


TTTGAAGGAA 


CCTTGTCGAG 


360 


CAAGGTATCA 


ATAGAGGGTT 


TTAACATCAT 


TTGCTTGTAC 


CTATTTTCTA 


AATTTTATCG 


420 


GGTAGTTGGA 


GATTTTGGTA 


ACATCTCCTG 


ATAGTGACCA 


ATGACACGAT 


CCACACAGAA 


480 


GTGTTCTGCT 


TCAATCACAC 


ATTTGACACG 


TTCAGCAGCT 


AGGGGTACCT 


GATCGTTGAC 


540 


AATCGCATAA 


TCATACTCAC 


GCATGAGGGC 


AATTTCTTCC 


TTGGCCTTTT 


CGATTCGTTG 


600 


GGCAATCACT 


TCTGCACTAT 


CTGTTCCACG 


ACCTACCAAG 


CGTTCTTGCA 


ATTCATCCAA 


660 


ATCTGGTGGT 


GTCAGGAAGA 


TAAAGACAGC 


ATCTGGAACC 


TTTTTCTTGA 


CCTGAAGAGC 


720 


ACCCTGAACT 


TCAATTTCAA 


GGAAAACATC 


GATTCCCTTG 


TCCAAGGTTT 


CATTGACATA 


780 


GGTCAGAGGA 


GTTCCATAGT 


AGTTGCCGAC 


ATATTCTGCG 


TATTCCAACA 


TCTGTCCTTG 


840 


ACGAATCAG 












849 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 581 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



GCCAAACAAC 


GCATCCATAA 


TGGTTAAATC 


TTGATAAGGC 


AGAGCTACAT 


TCAAAACAGC 


60 


TTGCTGGTTT 


GTAGGCTTTC 


AATCAGAGCA 


ATCACTTCTT 


CAACCTTGTC 


AGCATCAAGA 


120 


GCAGCTGTCT 


CAATCTTTGT 


ACTTGTTTTG 


CCTTCCAGTT 


TAGCTTTCAA 


GTCATCGCAT 


180 


TTTGACTTGG 


TACGGCTAGC 


AATCATAATC 


TCTGTAAATG 


TTTCGCTATC 


TTGACAAATC 


240 


TTTGAAATAG 


CAACTTGGGC 


AACGCCCCCA 


CAACCAATAA 


CTAGTAAACG 


ACTCATTTTT 


300 


TTCCTTCCTC 


TTCTTCTAAA 


ATGTCCTCAA 


CATACTTGGG 


CAACATAAAG 


GCTCCCACGT 


360 


GTAAGTTTGC 


AGTGTAGTAT 


TCTGTGAAAA 


GCTGGCGTTT 


TTTCCAGCCT 


TCCTTGTCAA 


420 


AATCTTTGAC 


AGGGTGGTAT 


TTTTTCGATG 


CAAATCCAAA 


CAACCAATAG 


CCAGCTGGGC 


480 


TAGTTGGAAT 


ATGGGCCTGA 


TAAACCCGAC 


TGATTGGAAA 


GGCTTGATTG 


ACCTTGCGGT 


540 
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GCATGCTTCG GCAGGCCGAC TCATCCTCGT CAAAGAAAGG A 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 





SEQUENCE DESCRIPTION: 


SEQ ID NO: 


74 : 








CCATACTTCT 


ACCAAGGAGA 


TACGCTAGCG 


GTCTCGGAAG 


TTCTCATACT 


60 


1\- LAI LUC 1 A 


TATCTGGCCA 


TCGCCTTTCT 


ATCCAAGGAT 


TGACAACAAG 


AGATAAGGCA 


120 


GAGAAAAAAT 


TCTTGCTGGA 


TCAGCTGGTT 


L»Cv_l GCGATG 


GTGGTACAGG 


TGTCATGCAC 


180 


GAAAGCTTCC 


ATGTAGATGA 


TCCGACCCTC 


TACTCTCGTG 


AATGGTTCTC 


CTGGGCTAAC 


240 


ATGATGTTCT 


GTGAGTTGGT 


CTTGGATTAC 


TTGGATATTC 


GCTAAGGGGC 


TCGCTTTAGC 


300 


TCAACCGATT 


CTTATCAGAA 


TCACAAGTTT 


ACATTTAAAA 


CGTTAAAATT 


TAAATTTAGA 


360 


ATGAGGTTTT 


ACTTCATGGA 


AAATGTTGTT 


GTACATATTA 


TCTCACATAG 


CCACTGGGAT 


420 


CGTGAGTGGT 


ACTTGCCTTT 


TGAAAGCCAT 


CGTATGCAAT 


TGGTGGAATT 


GTTTGACAAT 


480 


CTCTTTGATC 


TCTTTGAAAA 


TGACCCTGAG 


TTCAAGAGTT 


TCCACTTGGA 


TGGACAAACT 


540 


ATTGTCCTTG 


ACGACTAACT 


TACAAATTCG 


CCCTGAAAAT 


CGCGACAAGG 


TCCAACGCTA 


600 


CATTGACGAG 


GGCAAGCTTA 


AAATTGGTCC 


CTTTTACATC 


TTGCAGGATG 


ATTATTTGAT 


660 


CTCCAGTGAA 


GCCAATGTCC 


GCAATACCTT 


GATTGGTCAA 


CAAGAAGCTG 


CCAAATGGGG 


720 


TAAATCAACC 


CAGATTGGCT 


ACTTTCCAGA 


TACCTTTGGA 


AATATGGGAC 


AAGCGCCTCA 


780 


AATTCTTCAA 


AAATCAGGCA 


TTCACGTGGC 


GGCCTTTGGT 


CGTGGTGTGA 


AGCCGATTGG 


840 


ATTTGACAAC 


CAAGTCCTTG 


AAGATGAGCA 


GTTTACGTCT 


CAGTTTTCAG 


AAATGTACTG 


900 


GCAGGGTGTG 


GATGGTAGTC 


GTGTTTTAGG 


TATTCTCTTT 


GCCAACTGGT 


ACAGTAACGG 


960 


GAATGAAATT 


CCAGTTGACA 


AAGATGAGGC 


CTTGACCTTC 


TGGAAACAAA 


AATTGTCAGA 


1020 


TGTGCGTTGC 


CTACGCTTCG 


ACCAACCAAT 


GGTTGATGAT 


GAAACGGCTG 


TGAACACCAG 


1080 


CCTGTCCCAA 


AAAAAATCTG 


AGCGAAGCCA 


TTCCGTGTGG 


CAAATGAACT 


CTTCCCGGAT 


1140 


GTAATCTTTG 


TTCATAGTTC 


TTTTGATGAA 


TATGTTCAAG 






1180 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 579 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



CTCAAAAAGT 


CTGAAATTGG 


ATCATTTGGA 


ACCATCTGAG 


CAGTAAGTTT 


AACACCTTTT 


60 


TCTGCCAGTT 


TGTGGAAGTC 


TTCCACATCC 


TTGTCTACTA 


CGTTGATAGA 


ACGTGTAATA 


120 


GAACGAGTTT 


CTGGTGTTTG 


AGACATATTC 


CCAACATTAA 


GGGTTTCAAG 


TGGTACACCT 


180 


GCTTCTACCA 


AACCAAGGGA 


AGCGGTCTGG 


TTTACGAGCC 


ACGATAAAGA 


GACGTTGGCT 


240 


ATCGTATTTG 


CCAGCAAGAA 


TATTGGCTGC 


AGCTTTCTCA 


ACTGGCAAAA 


TACTCAATTT 


300 


CACACCTGGT 


GGTGTCGCAA 


GTTTCAAACC 


ACTCTTTTCA 


ATATCGTTGT 


TGACAACTTC 


360 


GTCGTCTACA 


ACCATAATGC 


GTGAAACATT 


TAGTTTTCCA 


GCCCAAAGAT 


TGGCTACTTG 


420 


TCCGTGGATC 


AAACGTCCAT 


CAATACGGCA 


TCCTACAATT 


GTCATAAGTT 


TTCCCCCTTT 


480 


ATATGTTTTA 


GTGTAGGTTT 


ACGAGTTAAA 


TGAATCTCTT 


CTTTATATTG 


AC CTTCTGTT 


540 


TCAAAGATAA 


TGATGCGGTT 


GGCACCTTCC 


TTGAGATAG 






579 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



CGCATANCCT 


AAAAGTTTTC 


TTTGACATCT 


TCTGGAATAT 


CTGTATCATT 


AGCTTCCTGA 


60 


ACATCTCTAC 


CATAAATCCC 


AAAGGCTGGA 


ATCCCTTTTT 


GAGCATGTGA 


AGCTAGTACA 


120 


GCTGCAAGAT 


AGACAGCTCC 


TGGGCGTTCT 


GTCCCATTAA 


ATCCCCAAAT 


AGCATGAGGA 


180 


ATATCTGGAG 


ACATATCCAT 


AGTTTCACTA 


CCATAACACC 


AGCATGGTGT 


AACTGTAATT 


240 


GTTGCGCAAA 


CATTTGATTT 


TTTAAACAAC 


TCATGGGAAG 


CTGCAGCCTC 


TGGAACACGA 


300 


CCAATGGTAG 


ATGGAGAAAT 


CACACATTCC 


ACAGGTTCCC 


CATCTGGATA 


TTTCAATGTG 


360 


CTTGAAATCA 


AATCTGCCAC 


ACTTTTAGCC 


ATGTTCATTG 


TTTGTACTTC 


AAGTGATTCG 


420 


CGTACACCTT 


GACGACGACC 


ATCAATAGTC 


GGACGAATCC 


CAATACGTGG 


ATGTTGAATC 


480 


ATACTATTTT 


CCTCTTATTT 


TTCTGTTTCT 


TTCATACTAC 


CATCGCCATT 


ATATTTACGG 


540 


TAGCCTGGGT. 


GACGACCTGT 


AACCTTATAA 


TTTTCTCGCA 


TTGAGAATAG 


ACGTTCTAAA 


600 


GTCGGACGAG 


CAATTTCTTG 


CTCCTCAATG 


CCCTTTGTAG 


AAAGTAACAT 


TCTTCCGTGG 


660 


AAGGTTGTCT 


TTGCGACTAA 


TTCTAAAGTT 


TCCATACGGT 


AGTATGCTGT 


AATTACATCG 


720 


CTTCCGACAG 


TCAGAG 










736 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 590 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


77 ; 






CTCATTGTAT 


TTAGGAGAAA 


TGATGGTATC 


TTCCAAATCA 


AAATCAACTT 


CACTCCATAG 


60 


TCTCAACGGA 


TTGATTTTCC 


CATCTTGATA 


GGTCACATCC 


TTGTCAAGGA 


TAAACTGAGT 


120 


CAACGCCTCA 


TGCTGACCTT 


GACACCTGAT 


GTCATCTACC 


AAGAGCCAGA 


CATCCTCTAC 


180 


CAACATGAGG 


ATTTTTCTCT 


TGTGAAGATA 


AGGCAAATCA 


GGTTCTGCTG 


ACCAATAAGC 


240 


CCCTTCAATA 


TAATGCATTC 


CCTCCCTTTC 


TTTATGGTGA 


CAAAACAGGG 


AGTGAGGATA 


300 


GTATTCATAT 


TCCCAGGATC 


CCGTGATTCT 


TTCCGGAGCT 


TTCCCATCTA 


CAATGCAGGT 


360 


CGAATGACTC 


CAAGCACTCT 


TTAAGAGATA 


ACGTTCATAT 


ATCTCCCGAT 


AAGAATAACG 


420 


CCCAGCATCT 


ATGAAAATAG 


GTTGGCCTTG 


ATACTGTAAG 


CAAAAACTAT 


TCTCGTCACT 


480 


ATGGCTATGG 


GCACTTCCTA 


GCGGACCATT 


TTTGAAAAAT 


AGATAACGAT 


GTTCATCCTT 


540 


AATGCAGACA 


TGTCCAGAGT 


CTTCAAAGAT 


CATGGACTTA 


GGCTGCCAAG 




590 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

CTAGTGTAGC TTCCTTAATC TTATCTGATA AGATAGCTGT CATATCAGAC TCAATCATTT 
CCTGGAGCAA TCACATTGAC TCGTATATTC CGACTAGCGA CCTCGCGTGC CACAGACTTG 
GTAAAGCCAA TCAAGCCAGC CTTAGAAGCA GCATAGTTAG CTTGACCAAT ATTCCCCATC 
AAACCAACAA CACTAGACAT ATTAATGATA GCACCTTCTC TGGCTTTCAA TCATCGGTTT 
CCAAAGACTG ATTGTGTCAT ATTAAAGGCA CCAGTCAGAT TGACCTTGAG CACTTTTTCA 
AAATCTGCTT CTGTCATCTT GAGCATAAGA GTATCTTGGG TAATCCCTGC ATTGTTGACC 
AAAACATCTA CTGAACCCAG TTCTGCAATA GCTTGATCAA TCATACGCTT AGCGTCTGCA 
AAATCTGATA CATCTCCTGA AATGGGAACC ACCTTGATAC CATAGTTTGA AAACTCAGCG 
AGCAATTCTT CTGAGATTGC CCCACGACTG TTTAAGACAA TGTTGGCTCC TGCTTGAGCA 
AACTTGTGGG CGATGGCAAG ACCAATTCCA CGACTCGAAC CTGTAATAAA GATATTTTTA 
TGTTCTAGTT TCATTTTTTT CCTTTCAAAA CTTCTACTTA TTTTAGTCTA TTTTTCTAAA 
AGTGCTACTA AACTCGCTTG ATCTTCCACA TGAGCTAAGT GAGCAGTTTG ATCAATTTTT 
TTAACAAAAC CTGACAAGAC TTTCCCCGGT CCAATCTCGA ATAAAGTTGC TTATGCCTGC 

105 
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TTCTTGCATG ACCCCAATAC TTTCATAGAA 

GAGCTGAGCA ATGTCCTCTT TTTGCATCAC 

AGTAAAATCT GAAAAACTTA CCTGAGCTAG 

AAGGAGAGCG GTGTGAAAGG GACCTGACAC 

TTCTTGCAAA AGTTCAACCG CTCGATCAAC 

TGCAGGTGTG TTATAGTTGG CTGGAGTAAC 

CTTCTTCAAT GACCTCTACT GGCGTATTGA 



PCT/US97/05306 

ACGAACGGGT TCCTTGACCT GACGCGTCAA 840 

AGCAGCTTCT GTATTGCCGA CTAGGGGACA 900 

AGTTTCAGCT AGTTTCTGGC TAGCAGGCTC 960 

CTTAAGAGGA ATCAAGCGTT TGGCACCTGC 1020 

TGCAACCACT TCTCCAGCAA TGACGATTTG 1080 

CACTCCAAGT TCCAGAAGCT TTTTGACAGG 1140 

GAACTGCTAC CATCTTGCCA AGTTCAGCA 1199 



(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1414 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 



CTACTATTTT 


CTTTGCCATA 


GCCTTCTCCT 


TTACACACTA 


CGCATATCGT 


GGTAAGAAAC 


60 


ACTGCGTCCC 


ATCTCACCTG 


CATTCTCTTT 


TTGAACAAAG 


GTATTAGCGT 


TTATATAGGC 


120 


AATAGCAGAA 


GCCTTCAACA 


CATCAAAATC 


AAGCCCTGCT 


GCATTAAAGA 


TGGTTTCTGT 


180 


ATCCCTGTTT 


TCAACAGTGA 


CCAACAACCC 


GATCCTGGGC 


ATCGATTCCA 


TCTGTTACCG 


240 


CATTGATAGT 


GTAGGACACC 


AAACGAACAG 


ATTGGTTAAA 


GAACTTATCG 


ATAGCGTTAA 


300 


AGATTGCTTC 


AACGGAACCC 


TGTCCCTGTC 


GCATTAAATT 


CGACTTTCTC 


ACCATCCATA 


360 


TTGGCTAGGC 


TAACGAGCGC 


TTCAATGTCA 


TTATCTGCAT 


GAGTTTGAAG 


TTGTAAATCA 


420 


TCAAAGTGGA 


AGCCTTCTGG 


ATTTTCAACC 


ATGGTTCCAG 


CTACCAAAGC 


TCGAGTATCT 


480 


GCATCTGTGA 


TTTCTTACTT 


CTTATCGGCC 


AGTGCCTTGA 


ACTTAGCAAA 


GAATGGTTTG 


540 


ATATCCTCTT 


CTGTAAAATC 


TAAGGCCAAT 


TCTCTCAGTT 


TCTCAACAAA 


AGCATGGCGA 


600 


CCAGATAATT 


TTCCAAGCGG 


AATCTTAACA 


CCAACCAATT 


CAGGTGTGAT 


GATCTCATAA 


660 


GTGAGAGGAT 


TTTTAAGGAC 


TCCATCTTGG 


TGAATACCAG 


ATTCGTGGGA 


GAAGGTATTG 


720 


CCACCAACGA 


CGGCTTTGTT 


TTTAGGAACT 


GGAATACCAG 


AGAAGCGAGA 


AACCATTTCT 


780 


GACGTATTGA 


TGGTCTCATT 


TAGGACAATA 


CTGGTTTCTA 


CTTGGTAGTA 


ATCTTGGCGA 


840 


ATATTGAGAG 


CCACTGCAAT 


TTCTTCCAAA 


GCAGCATTTT 


CAGCTCGCTC 


CCTAATACCA 


900 


TGGATAGTCT 


CTTCAAAAAG 


TCCTGCACCA 


TTCTTGACAG 


CAGCAAGGCT 


ATTTGCCACT 


960 


GCCATCCGAA 


GTTCATCATG 


ACAGGGAGGC 


GAATAGATGA 


TCTGACGATC 


CGTCTGGACA 


1020 


TTCTCAATCA 


GGTATTGGAA 


GATGGCACCA 


CATTCCTCTG 


GTGTGGTAAA 


TCCTATATTT 


1080 


TCTGAAAATT 


TCTTCAGTAA 


AGAATATTTA 


GCTAATTGAA 


AGTTCATGAA 


AATTATTAAA 


1140 


ATATTTCATT 


TTTTAGAGGT 


TAAGTTCCAA 


CTTTTTTCTA 


TCAATTCCAG 


TACTTNTTCA 


1200 


TCTGATAAAG 


TATCATCAAG 


GGACACACTA 


ATCCAGTAGC 


GCTTGCTCAT 


ATGGAAGGCT 


1260 


GGATAAATCC 


CCTTTTGTGA 


AAGCAAATTA 


GCTACTTGGT 


CATGCTTGAG 


GTTGACTGCT 


1320 


TCCACTTGTC 


CTTCTCTGCC 


CTTTTCCAGC 


TTATTCCAAG 


AGATTTTCAN 


CAAGACGGCA 


1380 


TACCACTTTT 


GATTGCCTTC 


ATGGCGCAAT 


ACAG 






1414 
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(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 750 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



CTCCCCATTT 


TGGAAAATTT 


CTGTCAAGAA 


ACGGCGAACC 


AGCTTTTTAT 


TTTCTGCTTT 


60 


CTTGTCCAAA TCCTTGATTT CAAAATCTCC 


AAAAATTTGA 


TCTAGTTGGT 


CATTTTCAGG 


120 


TGTTCGATAG 


TAGTCAATGA 


CATCCCAATG 


CTCAACAATA 


CGACCATTCT 


CATCCGCACG 


' 180 


GAAAGTATCC 


GTCGTCACCC 


ATTGAGCTTC 


TCCACCATTC 


AGATATTGAT 


GGAACATGAA 


240 


CAAAGACCAG 


ATTGCCATCC 


TCAATGGTGC 


GGACAATCTT 


AATCTGACGC 


TCTGGATGAC 


300 


GCTCAAAGAA 


ATCTGCAAAG 


AAGGCTGCAA 


ATCCTTCTTT 


CCCGTCAGGA 


ACACCTGTCG 


360 


AATGTTGGAT 


ATAGGTATCC 


CCTACAGACT 


GGGCTTGAGC 


CTCAGCAACT 


CGTCCGTCTT 


420 


GAATGGCATG 


GATGTATAGG 


TTGTGAGCAT 


TTTTCACTTG 


TTGTGACATA 


TTCTAAACCT 


480 


CATTTCCCTT 


CTCTTTCAGA 


TTCGCCAAAA 


TTCTTTCTTG 


AAAACCTTCA 


AATTGGTGAA 


540 


TTTCTTCCTC 


TGAAAATCCT 


TTGTAAAAGA 


TAGTATCCAA 


TTTCTGACTG 


ACACGATGCC 


600 


CCACTTCTTT 


CTGGGACTTG 


CCTAACTCCG 


TTAAAACTAA 


ATACTTCTTA 


CGCTTGTCTT 


660 


TTCCACACGG 


ACTAACAATT 


ACAAGCTTTT 


GTTCCTCTAG 


CTTTTTTATC 


ATAGTCGTCA 


720 


GCGTATTATT 


CGCAAGCCCA 


GTCGCAAGCG 








750 



(2) INFORMATION FOR SEQ ID NO: 81: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



GTATCGTTCA ATTCCAAAGA TACGGAGGAA GCTGCGTTTC GCGAGAAAAC TCTTGTTAGA 
CCAGATAGGG ATTAGCCAAA ATCCTTCTAT GNGCTTATAT CCAAATGGAG ATGCAAACTT 
CTGCCCAACT CTTGACCAAT AAGATCTTTC TTAAAAATCC ACTCAAGGCT CTGGTAGAGG 
AAAAATATGG GATTGAGTAT GAAGAATTTA CCAATCCTTG GCACGCTGCC ATTTCTAGCT 
TCGTTGCCTT TTTCCTTAGA AGTTTGCCTC CAATGCTGTC AGTGACCATA TTCCCAAGTG 



60 
120 
180 
240 
300 
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AATATCGCAT 


CCCTGCTACT 


GTCCTTATTG 


TCGGTGTGGC 


CCTTCTTCTC 


ACTGGTTACA 


360 


CTAGTGCCAG 


ACTTGGAAAG 


GATCCGACTA 


GAACAGCTAT 


GATTCGGAAC 


CTTGCTATTG 


420 


GTCTCTTGAC 


CATGGGAGTT 


ACCTTCCTGC 


TCGAACAACT 


TTTCAGCATT 


TAGAATACAA 


480 


GAAATACCTC 


GATTTTGAAG 


TCGAGGTATC 


TTTTTTACAT 


TTGCACAATC 


TTGCGATAAC 


540 


TTCTTGAAGT 


AATCATGAAA 


ATCAGCACAT 


AGGCGATGAG 


GAAGATAGCG 


CAGATAGACA 


600 


AGGTCACAAT 


CAACATCATA 


GTCGTATCCA 


GTACACCAAT 


CACTTTTAAA 


ATCAGGCTAA 


660 


GCATATGGTA 


GGCAAAGGCG 


AGATGTATGA 


AGGCAAAGAG 


CAAAGGAAGG 


AAGAAAACAG 


720 


TTAAAACCTG 


TTTGTTGATG 


GTTTGCTTGA 


TTTGCTTT TG 


GTCCAAACCG 


ACTTTCTGCA 


780 


AGATAATAAA 


GCGTTCACGG 


TCTTCGTAGC 


CTTCAGAAAT 


TTGTTTGTAG 


TAGATGACCA 


840 


GAACAGTTCC 


GACCATAAAG 


ATAATGGATA 


GGAAAATACC 


GATAAAGAAG 


ACACCGCCAA 


900 


AGAGGACACT 


CATTTGAGCA 


CTAGCATCTG 


CTAGATTGCT 


ACCATAAACA 


TAG 


953 



(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CTGATTGAAC 


TTCTATTTAC 


TAATATTCAA 


AAATCCTCCG 


TTTCAAAGAG 


CAGGGAACTC 


60 


TTTGTGACAG 


AGGATTTTTT 


CTATAGGGCT 


TTAGCAGCTG 


CAATTGCGGC 


TTCGAAGTTT 


120 


GGCTCAGAAT 


TGATATTATC 


CACGTATTCA 


ACGTAGCGAA 


TCGTATTGTC 


AGTATCGAGG 


180 


ACAAAGACTG 


CGCGTGCTAA 


TAGGTGCCAT 


TCGTTGATCA 


AGAGGGCATA 


ATCGCGCCCG 


240 


AAAGAATGGT 


CAAAGTAGTC 


TGAAAGCATA 


ATGGCATTGT 


CAAGGCCTTC 


AGCACCGCAC 


300 


CAACGTTTTT 


GAGCAAAAGG 


TAGGTCCATT 


GAAACAGTCA 


ATACGACCGT 


GTTGTCCAGT 


360 


CCAGCCAATT 


CTTCATTAAA 


ACGACGTGTT 


TGAGTTGAGC 


AGATGCCTGT 


ATCGATAGAA 


420 


GGAACGACAC 


TCAAGACTTT 


TTTCTTGCCA 


TCAAAATCAG 


CCAGAGATTT 


TTTAGAAAGA 


480 


TCTGTTGTAG 


TAAGAGAAAA 


ATCAAGCGCC 


TTGTCGCCGA 


CTTGTAGTTG 


TTTACCTGTA 


540 


AAGCTCACAG 


GATTTCCGAG 


AAAAGTTACC 


ATAGGATACT 


CCAATCTTTT 


TTCTTCCATT 


600 


GTATCTGAAA 


CAGTCAGAAT 


TTTCCAATGA 


TTTGACCGGA 


AATGTGGGCA 


TAGAAAAAAC 


660 


GCCAG 












665 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 97/37026 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 





SEQUENCE DESCRIPTION: 


SEQ ID NO: 


83 : 






i AAu 1 A 1 AA 


AAAACTTGCC 


GAAAGTATCT 


CTGAAGTTAA 


TTTGAATGTC 


TTTTTTCAAA 


60 


Al (jt- 111 ALf A 


TTTTGTTGAA 


ATCTTTAATC 


AAGAGCCTTG 


TCTGATAAAT 


TTTTTTGAGT 


120 


1 lAIxiAAl It. 


TATGTTACAA 


GGCGATATTC 


CAAAGTTAAC 


AATAAACCAA 


GAATAATAAG 


180 


AGGbCjAACAo 


TATGAAAGAT 


ACGATTTCTA 


ATAAGGATTT 


GATTTCCATG 


GGCTATAGAC 


240 


CATCAACAGC 


AAATGCTATT 


ATCCATCAGG 


TGAGAGAATT 


ACTTGTATCA 


CGAGGCTATA 


300 


CATTTTATAA 


TCGCAAACGT 


TTGATGGTTG 


TTCCAAAAAG 


TGTTGTGAAA 


GAGTTGTTGG 


360 


GAATGGAATT 


GTGAAATGGC 


TTATATCGAG 


TATAAACAGC 


GTGGGAAGAA 


AAGGCTTTGG 


420 


TCGTTTTCTA 


TACGTGAGAG 


GAGCAAGAGC 


CTACTCCATA 


AAAGCGGATT 


TAAAACAAAA 


480 


CGAGAAGCTA 


AAATAGAAGC 


GGAGAAAGTT 


CTTCATAAGT 


TAAATACTGG 


GAGTGTCTTA 


540 


AGCTCTAGTA 


TGACTTTATC 


GGAGCTTTAT 


AATGAATGGC 


TGGATTTAAA 


AATTTTACCT 


600 


AGCAATAGAA 


GTGTAGTTAC 


TAAAAAAAAA 


TATCTTATGA 


GAAAGAAGGT 


CATCGAAAGG 


660 


TTATTTGGAA 


ATAAGCCTGT 


ATCACAAATT AAGCCTAGCG 


AGTATCAAAA 


AATTATGAAT 


720 


GAGTATGGAG 


AGACTGTATC 


GAGAAATTTC 


TTAGGAAGAT 


TGAATTCTAG 


TATCCAGGCA 


780 


AGTATACAGA 


TGGCTATTGC 


TGATAAGGTG 


ATAATAGAGG 


ATTTTACTGC 


TTATGTTGAG 


840 


TTGTTCTCTT 


CTAAGAGTGG 


ACACAAAGGT 


TGAGGAAAAG 


TATCTACATA 


CTGAGTCAGA 


900 


TTATCAAAAA 


GTTTTAGTAT 


ATTTGAAAAA 


TAAGTTTGAT 


TATCAGAAAT 


CTATTGTACC 


960 


GTATGTAATT 


TATTTCCTTT 


TTAAAACTGG 


CATGCGTTTC 


TCCGAATTGA 


TTGCTCTAAC 


1020 


TTGGGATGAA 


GTTGACGAG 










1039 



(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 



CTGAGAAGTC AGGAAGGCGA TGTGGAGAGA TTGAGGAATT GGATACTTAA CTCCATAGTA 60 

ATCTTCGTAA AACTCGATAG AGCGAACAGC GATATCCAGT GAGAAATCAA GATTTGAAAG 120 

TGGATGTGCT TTGGTTGAGT AGACACCCTA CCAGGGTACC ATTTTTAGTT TTAGCGGTCA 180 

CCCCTTGCAA ATCACCAGCA ACAAAGGCCA ACAAGTAAGA AGACATGCGA GG TGTTGTCT 24 0 

CAAACTTCCA GATACCTGTT TCCTTACGGT TTTCAACATC GATTTCTGGC ATGTTTGACA 300 

AGGCCAATTC ACCTTCTGCT TGGTCAAAGC GAAGAGAGAG GTCAAAAGTT GCTTTGGCTT 360 

CAGGCTCATC CACACATGGG AAAGCTTCGC GCGCAAAATG GCTCTCGAAC TGAGTAGACA 420 

AGACCTCCTT CTTGACTCCA TCAACTGTAT AATAAGAAGG GTAAATCCCT GTCATGTTGT 480 
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CTGTAATTTT ACCAGAAAAG GCAAGAACCA 

TATGAAGGGC TTCATTGTCA TGGTCAACTG 
CAGAGGTGAT TTCCAAGTCT TTTTGGTGGA 

TGATGGTCAC TTTCCCAGAA AAAGTCTTGG 

AATGTTCAGG AACAAATTGC TTAATAAAAT 
CTAAGTTTAA GAG 



PCT/US97/053O6 

ATTCAACTTG ACCAGCCTCA GCCAATTCGA 540 

TAAATGGACG AGCTTGACCT GCAACTTCTA 600 

GGGAGATGCG GTCACTCTGT GCTTGACCAG 660 

TCTCACGACT CAAATCTAAA AATAAATCAT 720 

GTTCAACTGC TTGCATAGTT TTCTCCTATT 780 

793 



(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



.CGGATGATTT 


GCTTTTCATC 


TATGATAAAG 


TCTTTGTTGA 


GCGTCGGAAT 


CTCTACTGAC 


60 


TGGAAATTTC 


CCGTAGATAA 


TCCAAATGCC 


CTTTAAAGAA 


AACCTCATCT 


GTCAACACCG 


120 


AAATCATCAC 


TGCTCCGTTT 


TCTTCATAAG 


TCTGGGCCTG 


TTGCACAATA 


TCCACATCGA 


180 


GATTGATATC 


TCCCCAAAAC 


TAGGGCTAGC 


TTTCTTGACT 


TCAGCGATTA 


CCTGCAAGCG 


240 


GTCCTGATGA 


TTCTTCAAAA 


ATTCTGCCAA 


GCGATAGGTC 


TGGCGCAGAG 


GCTGGATTTG 


300 


CTCCAGCTTC 


ATCTGCTCCA 


CCTCACGCGC 


CTTCTGCTCT 


AAGATTCGTG 


CTAAAAATTC 


360 


CTGACTCATT 


TTTGGTACTC 


CTGTAACAGT 


CTGAGTTTTT 


CAAGGGCCTT 


GCCTCTAGCA 


420 


ATCACTTGAC 


GGGCCAAGGC 


AACTCCTTCC 


TTGATGCTAT 


CAATCTTACC 


ATTAGCATAG 


480 


AAACCAAGAC 


CAGCATTCAA 


GACTGTCGTT 


TCCAAGAATG 


GACTTGCTTC 


GTTTTTCAGA 


540 


ACGCTAAGCA 


AAATTTCTGC 


ATTTTCCTGA 


GCATTCCCAC 


CACGAATATC 


TTCCATAGCA 


600 


TAGCCTTCCA 


TTCCCAAATC 


CTCTGGAGTA 


AAGCTTGACA 


AG 




642 



(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



CTCTTGCAGG TTATTAAGGA GAAAACGGAG GTAATAGAAG TATGATTTAT ACAGTCACAC 60 
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TCAATCCATC CATTGACTAT ATCGTTCGTT TGGACCAAGT CAAAGTTGGT AGTGTAAATC 120 
GTATGGACAG TGATGATAAG TTTGCTGGTG GGAAAGGAAT CAATGTCAGC CGTGTCTTGA 180 
AACGTTTGAA TATATCAAAT ACAGCGACGG GATTTATCGG TGGCTTTACT GGTAAATTTA 240 
NCNCAGATAC TTTAGCAGAG GAAGAAATCG AGACNCGTTT TGTCCAGGTG GCAGAAGATA 
CTCGTATCAA TGTTAAAATC NAAGCAGACC AAGAAACAGA AATCAACGGA ACGGGTCCAA 
CTGTTGAACC GGTTAAGCTA GAAGAATTGA AAGCTATTTT ATCTAGTCTG ACAGCAGAAG 
ATACAGTTGT CTTTGCAGGT TCAAGTGCTA AAAATCTAGG CAATGTTATC TATAAGGGAT 
TTAATCTCCT TGACGCGCCA GACTGGTGCG CAAGTGGTCT GTGACTTTGA AGGACAGACC 
TTAATTGATA GTTTGGATTA CCACCCTCTT CTTGTAAAAC CAAATAATCA TGAACTTGGA 
GCGATTTTTG GGGTTAAACT CGAAAGTTTA GATGAAATTG AGAAATACGC TCGTGAGTTA 
CTGGCTAAGG GTGCTCAAAA TGTTATTATC TCTATGGCTG GTGATGGTGC CCTTCTTGTC 
ACATCTGAGG GAG 



300 
360 
420 
480 
540 
600 
660 
720 
733 



(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CTAAGGGAAG TAGGAAAAGT ATGTATCCAG ATGATAGTTT GACATTGCAA CACGGACTTG 
TACCAGATCA ACATGATGCA GGTTTACTTT GATCAAGGGA TTTACAATAA GAAGGCGGTC 
TTTGAGGTGT ATTTCCGCCA ACAGCCTTTT AAGAACGGCT ATGCGGTTTT TGCAGGTTTG 
GAAAGAATTG TGAACTATCT TGAAGACTTG CGTTTTTCAG ATAGTGATAT AGCCTATTTG 
GAGTCGCTTG GTTATCATGG GGCGTTCTTG GATTACCTTC GCAATTTCAA GTTGGAGTTG 
ACCGTTCGTT CTGCCCAAGA AGGGGATTTG GTTTTTGCTA ATGAACCGAT TGTCCAGGTG 
GAAGGACCTC TAGCCCAATG TCAGTTGGTC GAAACGGCTC TTTTGAACAT CGTCAACTAC 
CAGACCTTGG TGGCGACGAA GGCAGCCCCC TATCCGTTTG GTTATCGAAA ATGAACCCTT 
GATGGAGTTT GGGACACGTC GGGCTCAAGA AATGGATGCG GCCATCTCGG AACACG 

(2) INFORMATION FOR SEQ ID NO: 88: 



60 
120 
180 
240 
300 
360 
420 
480 
536 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



CTGCCCCTGT 


AAGGCTGGAC 


GATTGCCTTT 


CTTAGTATCC 


GCAAAGAGGT 


AAACTGAGAA 


60 


TAGAGAGGAT 


TTCTCCTTCA 


ATATCTTTGA 


CAGACAGGTT 


CATCTTGCCT 


TCTACGTCTG 


120 


AAAAAATCCG 


CATATTGACC 


AGTTTTCTCA 


CAGCATAGTC 


CAAATCTTCC 


TCTTGGTCCT 


180 


CTGGTCCAAC 


ACCAACCAGC 


AATAAAAGTC 


CCTGATTGAT 


TTTTCCCTGA 


ATCTGGCCTT 


240 


CTATACTCAC 


TTGGGCTTTT 


TTAACCCGTT 


GGATAATGAT 


TTTCATAATA 


GCCTTTCTAG 


300 


TAAGAGCTAG 


GACAACTAGC 


CGTTGGTCCG 


TTTGACAGAG 


TAAACTTCTG 


GCACACTCTT 


360 


AATTTTATCG 


ACAACCGTGG 


TCAGTGTAGA 


GAGGTTGGCA 


ATACCGAAGG 


ACACATGGAT 


420 


ATTAGCAAAC 


TTCATATCCT 


TGGTTGGTTG 


GGCATTGACC 


GTTGAAATAT 


TCTTGGTTGT 


480 


ATTTGAAAGA 


ACTTGCAGTA 


CATCGTTCAA 


CAGTCCTGTA 


CGGTTGAGAC 


CGTAGATATC 


540 


GATATGGGCC 


ATATACTCCT 


TATTTGAG 








568 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 



CTTTATATTG 


TACCAAGTAT 


CATTTCTAGT 


GCTGCTATGG 


GCTTGATTTT 


TCTTCAAATC 


60 


TATAATCCAA 


ACTATGGTGT 


TGTTAACCAA 


ATTATTCATC 


TATTTAATCC 


ATCGTTTAAA 


120 


GATTCAGTAC 


TGTTGACTCC 


CAGGATTAAA 


AATAGTAGCT 


ATGACTGGCG 


CTTATATCTT 


180 


CTTTGCAGGA 


GCATCAACCA 


TTATGATTTC 


TTGGGCAAAT 


TTTTGCTATT 


CCAGAAGAAG 


240 


TTCAAGAAGC 


TGCTATTTTA 


GACAATATTA 


CTGGTTGGAG 


GAAAGAGTGG 


TATATTACGA 


300 


TTCCGATGAT 


TAAGGGGACA 


ATTAAAACTG 


TTTCAATTAT 


GGCAGCAACT 


TCAGGATTTT 


360 


TGCTCTATAA 


CGAAGTATTC 


TTTTTGACAA 


ATGGTGCTGC 


AGGAACAAAA 


AGTATCAGTT 


420 


TTGTTATTCG 


AGAATTAGCA 


GTGGCTAGCT 


CACGAACTCA 


GTATGCTCGT 


GCAAATACAA 


480 


TTGGAGTTAT 


ACAAATCTTA 


GGTGGAATGT 


TGATTATCGT 


TTGTATTAAT 


ATTTTATTCA 


540 


GAGAAAGAAA 


AAGACTGAAA 


GGTGGGAAAT 


GATTATGAAT 


ACACATATAA 


ATGGTATTAG 


600 


TAAAAAAGGC 


AAAGTTCTTA 


TATATGGTTA 


TATGCTCCTT 


ACCATTTTAA 


TTTCTATTTT 


660 


CCCTATTGCG 


TGGATTTTTT 


TATCATCATT 


AAAAGCAGAT 


CCTATGAAAA 


ATCCAGGTAT 


720 


TAGTTTACCG 


ACTGACTTTA 


CTCTTGAAGG 


TTATATAAAT 


GTTTTTACAA 


AACTTCATGT 


780 


TTTTACTTAC 


TTTTGGAATA 










800 



(2) INFORMATION FOR SEQ ID NO: 90: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: Genomic DNA 



(xi ) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


90: 






CTCAGTATTA 


TCCCTACCAA 


AATGAACGTC 


AGTTAGTGAC 


TCAGGTGGTT 


TGGGAACAAT 


60 


GGGCTTTGGA 


ATTCCAGCAG 


CAATCGGTGC TAAAATTGCT AACCCAGATA AGGAAGTAGT 


120 


CTTGTTTGTT 


GGGGATGGTG 


GTTTCCAAAT 


GACCAACCAG 


GAGTTGGCTA 


TTTTGAATAT 


180 


TTACAAGGTG 


CCAATCAAGG 


TGGTTATGCT 


GAACAATCAT 


TCACTTGGAA 


TGGTTCGCCA 


240 


GTGGCAGGAA 


TCCTTCTATG 


AAGGCAGAAC 


ATCAGAGTCG 


GTCTTTGATA 


CCCTTCCTGA 


300 


TTTCCAATTG 


ATGGCGCAGG 


CTTATGGTAT 


TAAAAACTAT 


AAGTTTGACA 


ATCCTGAGAC 


360 


CTTGGCTCAA 


GACCTTGAAG 


CTACTACTGA 


GGATGTTCCT 


ATGCTAATTG 


AGGTAGATAT 


420 


TTCTCGTAAG 


GAACAGGTGT 


TACCAATGGT 


ACCGGCTGGT 


AAGAGTAATC 


ATGAGATGTT 


480 


GGGGGTGAAG 


TTCCATGCGT 


AGAATGTTAA 


CAGCAAAACT 


ACAAAATCGT 


TCAGGAGTAC 


540 


TCAATCGCTT 


TACAGGTGTC 


CTATCTCGTC 


GTCAGGTTAA 


TATTGAAAGC 


ATCTCTGTTG 


600 


GAGCAACAGA 


AG ATC CGAAT 


GTATCGCGTA 


TCACTATTAT 


CATTGATGTT 


GCTTCTCATG 


660 


ATGAAGTGGA 


GCAAATCATC 


AAACAG 








686 



(2) INFORMATION FOR SEQ ID NO: 91: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



TTTTCCCAGC TATTTTACTG AGTATAAAAG CCTTATTTAA CTTACTCTAT GTACTCGGTT 60 

TTCTAGGAGG AATGTTGGGA GTTGGGATTG CTTTGGGGTA CGGAGTGGCC TTATTTGACA 120 

AGGTTCGGGT GCCTCAAACA GAAGAATTGG TGAATCAGGT CAAGGACATC TCTTCTATTT 180 

CACAGATTAC CTATTCGGAC GGGACGGTGA TTGCTTCCAT AGAGAGTGAT TTGTTGCGCA 240 

CTTCTATCTC ATCTGAGCAA ATTTCGGAAA ATCTGAAGAA GGCTATCATT GCGACAGAAG 300 

ATGAACACTT TAAAGAACAT AAGGGTGTAC TACCCAAGGC GGTGATTCGT GCGACCTTGG 360 

GGAAATTTGT AGGTTTGGGT TCCTCTAGTG GGGGTTCAAC CTTGACCCAG CAACTAATTA 420 

AACAGCAGGT GGTTGGGGAT GCGCCGACCT TGGCTCGTAA GGCGGCAGAA ATTGTGGATG 480 

CTCTTGCCTT GGAACGCGCC ATGAATAAAG ATGAGATTTT AACGACCTAT CTCAATGTGG 540 

CTCCCTTTGG ATCGAAATAA TAAGGGACAG AATATTGCAG GGGCTCGGCA AGCAGCTGAG 600 
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GGAATTTTCG GTGTAGATGC CAGTCAGTTG ACTGTTCCTC AAGCAGCATT TTTAGCAGGA 660 

CTTCCACAGA GTCCCATTAC TTACTCTCCT TATGAAAATA CTGGGGAATT GAAGAATTGA 720 

TGAAGACCTA GAAATTGGCT TAAGACGGGC TAAGGCATTC TTTACAGTAT GTATCCTACA 780 

GGTGCATTAA GCAAAGACAA TT 802 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



CTACTATTTT 


AGCATAAAAA 


TGCCCAAAGG 


GGGNGCCGTG 


TGTTTACTGA 


TTTTCAGGNT 


60 


AATGGACCAG 


GNAAATCAGC 


ATGAAAATAA 


AAAGAGAAAC 


AGATTATTTT 


AGCCATTTGT 


120 


CAGATTTATG 


CTATGCTTAA 


GGTAGAAAAT 


GAAAGGGATA 


ACAAATGTAT 


TTAGGAGATT 


180 


TGATGGAGAA 


AGCCGAGTGT 


GGTCAATTTT 


CAATCCTTTC 


CTTTCTATTA 


CAAGAGTCTC 


240 


AGACGACCGT 


CAAGGCTGTA 


ATGGAAGAAA 


CAGGATTTTC 


AAAAGCAACC 


CTAACCAAAT 


300 


ATGTCACCCT 


GCTCAATGAC 


AAGGCTTTGG 


ATAGTGGCTT 


AGAACTGACT 


ATTCACTCAG 


360 


AAGATGAAAA 


TCTGCGTCTG 


TCTATAGGTG 


CAGCTACTAA 


GGGGAGAGAT 


ATTCCGGAGC 


420 


TTGTTTTTTG 


GATAGTGCTG 


TTAAATACCA 


GATTTTGGTT 


TATCTTCTCT 


ACCACCAACA 


480 


GTTTTTAGCC 


CATCAGCTGG 


CTCAAGAATT 


GGTGATTAGC 


GAGGCTACGC 


TTGGTCGTCA 


540 


CTTAG 












545 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 576 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GAGGTCAAGA TTGATTCCCA AGGTTNGAGG AATCTGCGGA ACCAGATTTA GAAGAAATCG 60 

AGCGGTTACC ATGCTTGGCC ATGTGAATAC CGCCACCAGC CAAGACAAAG GCTGCAGTTG 120 

TGGAAATATT AAAACTGAAA GACTTGTCCC CACCTGTACC ACAGTTGTCC ATAGCATCAT 180 

GAATCTCAGT TGGAATATGC TGGGCATGTC CTCTCATGAC TTGGGCAATG GCTGTGCGTT 240 
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CTTCAGGTGT TTCCCCCTTC ATCTTAAGAG CTAAGAGGAG AGAAGCAATC TGCGCTTCAG 300 

TTACACGCCC AGTTACGATA CGCTCAATGA CATCCGTCAT TTCCACACCT GATAAATTTT 360 

CAAATTTTGC TAGTTTTTCA ATAATCTCTT TCATCCTAGT TTCCTCACTT TACAACCTCC 420 

TCGATAAAAT TCCGAATAGA AGACAAGCCG TCTGGCGTTC CAATGCTCTC TGGATGGTAC 480 

TGGAAGCCAT AAATCGGTAG GTTTTTATGT TGAATCCCCA TGATGGCTTG GTCATCAGTC 540 

GAACGAGATC AAGCTTATCG ATACCGTNGA CCTCGA 576 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

ATTAAAGGGA GGCGAACATG GCCCAAGATA TAAAAAATGG AGAAGTAGAA GAAGTTCAAG 
AAGAGGAAGT TGTGGAAACA GCTGAAGAAA CAACTCCTGA AAAGTCTGAG TTGGACTTGG 
CAAATGAACG TGCAGATGAG TTCGAAAACA AATATTCTTC GCGCTCATGC AGAAATGCAA 
AATATCCAAC GCCGTGCCAA TGAAGAACGT CAAAACTTGC AACGTTATCG TAGCCAGGAC 
TTGGCAAAAG CAATCTTACC ATCTCTTGAC AACCTTGAGC GTGCACTTGC AGTTGAAGGT 
TTGACAGATG ATGTGAAGAA GGGCTTGGGG ATGGTGCAAG AAAGCTTGAT TCACGCTTTG 
AAAGAAGAAG GAATTGAAGA AATCGCAGCA GATGGCGAAT TTGACCATAA CTACCATATG 
GCCATCCAAA CTCTCCCAGC AGACGATGAC CACCCAGTAG ATACCATCGC CCAAGTCTTT 
CAAAAAGGCT ACAAACTCCA TGACCGCATC CTACGCCCAG CAATGGTAGT GGTGTATAAC 
TAAGATACAA AGCCCGTAAA AAGCTCGCAG TAAAAATAGG AGATTGACGA AGTGTTCGAT 
GAACACAAGA AAATCTANCT TTTTTACTCA GAGCTTAGGG CGTGTTCGAT TCGGCAATTC 
TGACGGTAG 

(2) INFORMATION FOR SEQ ID NO: 95: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
669 
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CTAGTTCACC 


ATTTCTATTT 


AAA I Iv-vjiOv- 1 




PAA ATATPAT 


ATAAGGTAAA 


60 


GTTCCCACTA AGAATCTCAG 


J\ H TV* 7\ 7A 7i mmp 

AniuAnAl 1 v_ 






AAPTTAGCTT 


120 


ATLLAA1 I i\- 


a f2 appc: a t a a 


ACCC A I At- 1 1 


par' T\f a hp ar"p 


TPPTPTTT2 A A 


ATTTAGAGTA 


180 


ACTCTTATTA 


GACATTTGAT 




r^ncccv a tt & 


HI A Af2f2 A fXPPP 


H ATTTAAAAC 


240 


TTCTTTATTC 


ATGAAATTTC 




A Tip A A A AC AT 


TPTATATAPT 


TAAATTTTAf; 


300 


GGCAAGACTG 


CCGTTCCACT 


GACCACGCTC 


/— » » rT>-rrV'« /~< A ^ 

C AC TTGC CAG 


CAAACjL. I\j\jV- 




joU 


TTACCCTAAA 


TATAAATTAT 


TTAATTTTTG 


CCATTTCAAT 


CAATTCTGCA 


AAGGTCCCAA 


420 


GCTTTCTGGA 


TTGAATTACC 


TCATATTGGG 


CATTCTGTCC 


AAATGAAGCA 


TTATTAATCA 


480 


CTTCAACCTC 


ATAAAGGCCT 


GGAGCAGAGA 


AAACAGCTCC 


AAATTTGGAT 


TTATCGAAAT 


540 


GCATACCTAG 


GTCACTCCAA 


CAATCATTAA 


GGGTGGAGAC 


TTTTATCCAG 


CCTCGTTCTT 


600 


TCTTAGTAGA 


AATAG 










615 



(2) INFORMATION FOR SEQ ID NO: 96: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 773 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CTACTGTAGT 


TGGAACTGTT 


GAAAAACAAG 


GAAAACAAAA 


GAAAGTTGTT 


ACTTACAAGT 


60 


ACAAACCTAA 


AAAAGGTAGC 


CACCGTAAAC 


AAGGTCACCG 


TCAACCATAT 


ACAAAAGTTG 


120 


TCATCAACGC 


AATTAACGCT 


TAATTTTAAG 


GAGAACACAT 


GATACAAGCA 


GTCTTTGAGA 


180 


GAGCCGAAGA 


TGGCGAGCTG 


AGGAGTGCGG 


AAATTACTGG 


ACACGCCGAG 


AGTGGCGAAT 


240 


ACGGCTTAGA 


TGTCGTGTGT 


GCATCGGTTT 


CTACGCTTGC 


CATTAACTTT 


ATCAATTCTA 


300 


TTGAGAAATT 


TGCAGGCTAT 


GAACCAATCC 


TAGAATTAAA 


CGAAGATGAA 


GGTGGCTATC 


360 


TGATGGTTGA 


AATACCAAAA 


GATCTTCCTT 


CACACCAGAG 


AGAAATGACC 


CAGTTATTCT 


420 


TTGAATCATT 


TTTCTTAGGT 


ATGGCAAACT 


TATCGGAGAA 


CTCTTCTGAG 


TTCGTCCAAA 


480 


CCAGAGTTAT 


CACAGAAAAC 


TAACACGGAG 


GAAAACATTA 


TGTTAAAAAT 


GACTCTTAAC 


540 


AACTTGCAAC 


TTTTCGCCCA 


CAAAAAAGGT 


GGAGGTTCTA 


CATCAAACGG 


ACGTGATTCA 


600 


CAAGCGAAAC 


GTCTTGGAGC 


TAAAGCAGCT 


GACGGACAAA 


CTGTAACAGG 


TGGATCAATC 


660 


CTTTACCGTC 


AACGTGGTAC 


ACACATCTAT 


CCAGGTGTAA 


ACGTTGGTCG 


TGGTGGAGAC 


720 


GATACCTTGT 


TCGCTAAAGT 


TGAAGGCGTA 


GTACGCTTTG 


AACGTAAAGG 


ACG 


773 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



WO 97/37026 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CTGGATAATT GGAGCAATTC CTGATGCCAG TTTGAGGTTA CTGATAGGAT TGTGGGCGAT 
AGCCACTTGA GAAGATGCCA AGAGTTCAAT TTCTCTCTCG TTTAATTCGA CCCCGTGAGC 
AAATACGGAC GGATGATCTA AATAACCCAG TTCTTCAAGA AAAGCAAGGG GGCGTTTGCC 
GTACCGTTTG AGGATAATTC CTGACTCCTC CTTGGTCTCC GCCACATGGA CATGGAGCGG 
AATATTTAGC TCTTTTGCCA TTTCCAAACT CGCTTCCAGC AAGTCTCTAC TGCAGCTATA 
CGGAGAATGA GGTGCTACCA TAACCCTTGA AATTTGGGAT TTTTATATTT TAAGATTTCG 
TCTATGATGG ATCGAGTTCT GCTTATAGTC TCAGCAGTTG TTTCTGTCTC TGAAGAAAAG 
AGAGTCGGAG AAAAATAACA ACGCATCTTG GAAGTTTTCA CCACCTGATA AATTTGCTGG 
ATATCCACAC CATTGGGATT ATACATATCG TTAAAGGTTG TTGTTCCTGA CTGGAGCATC 
TCTGTTAGGG CTTCTTTGAC CGCATTCGTA GTCATGTCGG GAGTAAACTC AGATTCTCCT 
GGCCAGATAT AGTCATTGAG CCATTTCATG GAGATTGCTG TCATCTCGGA TCCCTCTCAG 
ACCTGTCATT GCAGAATGGG TGTGACAATT GACCAAACCA GGCATAAGCC AAG 

(2) INFORMATION FOR SEQ ID NO: 98: 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CTAGAAAATC TGCTTTTTTC TCGAAAAGTT TAATACTTGG TACAATATTA CGCTTGCTTT 
GCTCGGCTGC TCCAAGGGCA ATTTTTTCTA GTTTTTCAAC TTTTTTACCC AATTTCTTGC 
CATCCCACTT GGCAACTGAC CAGTCTGCAG GAAAGGCCCA GATTTGGCTA GCACCCAGTT 
CGGTTACTTT TTGAGCGATG AACTCCAGCT TGTCTCCCTT GGGAAATCCA GATGCGATGG 
TCACTTGGAC TGGTAGTTCC ACATTGTCAT TTAATTCTTG GACCAACTCA AACTGACGAT 
TTTCCATATC CAGCACGCGC GCCAAGCGCT TGATGCCATC ATCAAAGACT AAGGTAACCT 
CATCCTCTTC TTTCAAGCGC ATAACCTGAA ACATATGCTT ACTGGTTTCC TTGTCCTCGA 
TAGTGACAGG AGAGATAGCA CTGCCTTTTA CAAAATACTG CTGCATGCTA GCCTCCAATC 
ACACCAGAGA TATCCTTGGT TTTCTTAAAG ACACAGGTAT TCCATTCCCC TTGAACCATG 
TGAGTTTCGA GGAAAAATCC AGCTGATTCA GCCGACTCGC GCACCATGTC CCACTTATGC 
CTTGAATAAT GCCACTCATG ATCAGGTAGC CTTCGTCCTT AACCAAACGA TAGGCATCGT 
CTATTAGATG AATGAGGATA TCCGCCAAGA TATTAGCCAC AATCACATCT GCCTCAATTT 
CCACACCCTT AAGCAAATCT CCAGCCGCTA CATGGATATT TTCCATGCCA GGGTTGAG 
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120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
778 
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(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 



CTTCCCAGAT 


TTTTTCGTTG 


ACTTGCTGTT 


CAATCGCACG 


CAATTCTTCA 


GCAGTTACAG 


60 


CTTGGAAGTG 


GGTAAAGTCA 


AAGCGAAGGA 


ATTCAACTTC 


GTTAAGAGAT 


CCTGCCTGTG 


120 


TTGCGTGGTT 


TCCAAGGATA 


TTGTGAAGGG 


CAGCGTGAAG 


CAAATGAGTC 


GCAGTGTGGT 


180 


TTTTCATGAC 


ACGGTGACGG 


CGATTGCTAT 


CAATTGCCAA 


GGTATATTCT 


TGGTTCAAGG 


240 


CAAGCGGTGC 


AAGGACTTCA 


ACTGTATGAA 


GGGCTTGACC 


ATTTGGGGCT 


TTCTGAACAT 


300 


TGGTCACAGT 


AGCCACAACC 


TTACCTGACT 


CATCCAAGAT 


TTGTCCGTAG 


TCAGCTACCT 


360 


GTCCACCCAT 


TTCAGCATAA 


AATGACGTTT 


CCCGCAAAGA 


TAAGAGAGGC 


AGTTCCTTCT 


420 


GAAACAGCTT 


CTACTTCTGC 


ATTGTCCGCC 


ACGAATAGCT 


ACCAATTTAG 


AAG AC AATTG 


480 


GCTAGCATTG 


TAGTTGAAGG 


CACTTTCTAC 


AGTGATGTTT 


TGAAGAGTTT 


CCATTTTGCA 


540 


TACCCATTGA 


GCCACCCTTG 


ACAGCTGACG 


CACGCGCGCG 


TTCTTGTTGT 


TCTTTCATGG 


600 


CTGCTTCAAA 


ACCTTCACGG 


TCTACAGTCA 


TACCAGCTTC 


TTCAGCGATT 


TCTTCAGTCA 


660 


ATTCAACTGG 


GAACCCATAA 


GTATCATAGA 


GTTTGAAGAC 


ATCTGAACCA 


GCGATAACAG 


720 


ATTGACCTTT 


TTCTTTCAAG 


TCTGCTACAA 


TGCCTTGGGC 


AAAGTGTTGA 


CCTTGAGTGA 


780 


AGGGTACGGG 


CAAATGATTC 


TTCTTCGCCT 


CTTAACGATT 


TTCTCAATAA 


AGTCACGTTT 


840 


CTCAAGCACT 


TCTGGGTAGT 


AGCTTTCCAT 


GATTTTTCCA 


ACAGTTGGAA 


CGAGTTTTGT 


900 


AAAGGAAAGG 


CTCGTTGATA 


CCCAATTTTT 


GACCCATGCA 


TAGAAGCACG 


ACGGAGAAGA 


960 


CGACGAAAGG 


ACATAACCAC 


GACCCTCCAT 


TTCCTGGAAG 


GGCACCATCC 


ACCGATGGCA 


1020 


AATGAAAGTG 


AACGGATGTT 


GGTCCGCGAA 


TGAACCTTGA 


AT 




1062 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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CTCAAAGGTC GCACCTGCAA TACGACCCGC 
ACTATTGCCT ACATAACTAC CAAAGTCATC 
AGCCTTGTCA GGTGCGACAT AGCGCAAGAT 
GTAGCCACCG TCTGTCTCAA ATCGATAGGC 
TACACGCTCT GTGTATGCTT TCATTCTGTT 
ACCATAGAAA GCGTACTGAC AATCTATGGT 
AGAATTCATA AACACTGTCT TACTTTTGAT 
GGTTTACTGT ACCTGTAGGA GACGGTGTAT 
ATAGGGCAAA GAGTTGGGCA TAAACGATGT 
CACCGCCACA ACCAAGGGCC ACTTCTTTGA 
AAAGCACAAC ACGACGAGCA ATCTGGTCAC 
ACTTACGAGT GTAGTCCGTC CTTGTACCAA 
GATATTTGGA CCGTGACGGA ACCAACTGGG 
GTTAATCCCA AATCTTGAGC TGAGCCTCAT 
GAGACCGGTT AAGTCTTAAT CACAG 

(2) INFORMATION FOR SEQ 



PCT/US97/Q5306 

TACAGGACCT ACACTTGCTC CATGCTTGGG 60 

AAATCCCAAG . ATAACATTGG CAAAATTTCC 120 

AGTCGCACCA TAAGTCATAA CCTCAAGTTG 180 

CAAGACATCC TCACCCTCAA CATTTCCAAA 240 

CTCCTTTTAC TATTTCTCTC AAGCAAACAA 300 

TTATCTGATA ATTTACAAAT CCTCTTGTCA 360 

ATTCGTGAAT TATGACACCT TGTACTACAC 420 

CTGGTTTATT TTCTACCTTG AGTGAAGTCA 480 

AAGGGAAGAC ACGGTAAATA TCATTCAAGA 540 

CATTTTCAAG ACCAAAAGCT TGATCACTCA 600 

CAGCAACTTC ACGAACCAAG TCCAAGTCGT 660 

AGAACAAAAC AACTGTATTG TCGTTGATAA 720 

CTTTCATACA TGGTCGCAAC TTGAACAACA 780 

GAGCAGTCCA AAGAAGACCA GCGCCTAGAA 840 

865 

ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION : 


SEQ ID NO: 


101: 






TTTTGCTCCA 


GTTGATAAGA 


AAGGCTTGGT 


CGATGTTGAG 


GCGTTAGCAG 


GTTTGATACG 


60 


GCCTGATACA 


ATCCTCGTTT 


CCCATCATGG 


CTGTGAACAA 


TGAAATCGGC 


TCTATCCAAC 


120 


CNATTGAGGC 


TATTTCAGAA 


TTCTTGGCAG 


ACAAGCCGAC 


TATTTCCTTC 


CACGTTGATG 


180 


CGGTTCAGGC 


GCTTGCCAAA 


ATTCCGACTG 


AAAAGTATCT 


GACAGAACGG 


GTGGATTGCG 


240 


CGACCTTCTC 


GAGTCATAAA 


TTTCATGGTG 


TCCGTGGTGT 


TGGCTTTGTC 


TATATCAAAT 


300 


CTGGCAAGAA GATTACACCT 


CTTCTTACAG 


GTGGTGGCCA 


GGAGCGAGAT 


TATCGTTCGA 


360 


CAACTGAAAA 


TGTGGCAGGG 


ATTGCAGCGA 


CAGCCAAGGC 


TCTCCGTTTA 


TCTATGGAAA 


420 


AGCTAGATAT 


CTTTAGGAGC 


AAGACTGGGC 


AGATGAAGGC 


AGTGATTCAC 


CAAGCTCTTC 


480 


TGAACTATCC 


GGATATTTTT 


GTCTTTTCAG 


ATGAGGAAGA 


CTTTGCACCT 


CATATTCTGA 


540 


CTTTTGGAAT 


CAAAGGTGTT 


CGAGGTGAAG 


TCATCGTTCA 


CGCCTTTGAA 


GACTATGATA 


600 


TTTTCATCTC 


AACAACCTCA 


GCTTGTTCAT 


CTAAGGCAGG 


AAAACCAGCC 


GGTACCTTGA 


660 


TTGCCATGGG 


AGTGGACAAA 


GATAAGGCCA AGTCAGCTGT 


GCGTCTTAGC 


CTAGACTTGG 


720 


AAAATGATAT 


GAGTCAGGTC 


GAGCAGTTTT 


TGACCAAGTT AAAATTGATT 


TACAATCAAA 


780 


CTAGAAAAGT 


AAGATAGGAG 


CATTCATGCA 


GTATTCAGAA 


ATTATGATTC 


GCTACGGAGA 


840 


GTTGTCAACC 


AAGGGTTAAA 


AACCGTATGC 


GTTTCATCAA 


TAAACTTCGT 


AATAATATTT 


900 


CGGACGTTTT 


GTCTATCTAT 


ACCCAAGTTA 


AGGTAACAGC 


AGATCG 




946 
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(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



CTTGCTCGTA 


ACAGGTGCTA 


TCCTTGGTGT 


CAATGTTCAC 


ATCTTCTCAC 


CAAAAGAACT 


60 


CTTCCCAGAA 


AAAGAAATCG 


TTGAATTGGC 


AGAAGGATTT 


GCTAAAGAAA 


GTGGCGCACA 


120 


TGTTCTCATC 


ACTGAAGATG 


CTGATGAAGC 


AGTTAAAGAT 


GCAGACGTTC 


TTTACACAGA 


180 


CGTTTGGGTA 


TCAATGGGTG 


AAGAAGACAA 


ATTCGCAGAA 


CGTGTAGCTC 


TTCTTAAACC 


240 


TTACCAAGTC 


AATATGGACT 


TAGTTAAAAA 


AGCAGGCAAT 


GAAAACTTGA 


TCTTCCTACA 


300 


CTGCTTGCCA 


GCATTCCACG 


ATACTCACAC 


TGTTTATGGT 


AAAGACGTTG 


CTGAAAAATT 


360 


TGGTGTAGAA 


GAAATGGAAG 


TAACAGACGA 


AGTCTTCCGC 


AGCAAGTACG 


CTCGCCACTT 


420 


CGATCAAGCA 


GAAAACCGTA 


TGCACACTAT 


CAAAGCTGTT 


ATGGCTGCTA 


CACTTGGTAA 


480 


CCTTTATATT 


CCTAAAGTAT 


AATTTTAGAT 


AATAAACCGT 


CTACCAACAG 


CTATGAGGGC 


540 


TGCGACTAAT 


AG 










552 



(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 base pairs 

(B) TYPE: nucleic acid 

{ C ) STRANDEDNES S : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 



CTAAAAAAAG 


TCAAGTAGAA 


AACGAATATT 


CTACTTAACT 


TGCACGAAAT 


TATTTTTCAC 


60 


GAATGACTTC 


GACCTTATAT 


CCATCAGGGT 


CTTTGACAAA 


GTAATAGTTT 


GGTGCAGTTC 


120 


CTGGTAGACC 


ATTTGGCTCA 


GTCACTTCAT 


AGCCTTTTGT 


ACTGTGCTCT 


TGATGAAGTG 


180 


CCTCAAGATC 


AGGTGTACTG 


AGGGCGATAT 


GGGCAAACCC 


ATCACCAACC 


ACATACGGAC 


240 


CGTGATCGTA 


GTTATAAGTC 


AACTCCAACT 


CATAGTCATC 


ACCCTCAAGA 


CCTAGATAGA 


300 


CAATCGTGAA 


GGCATGGTCT 


GGAAAATCTC 


TGCGACGCAA 


TTCTTTAAAA 


CCAAAAGCAT 


360 


CTTGATAAAA 


TGCAATTGAT 


TTTTCAAGAT 


TTTCTACTCG 


TAAGCAAGTG 


TGTAGCATTT 


420 


TTGAAGCCAT 


ATCTTTCTCC 


TTTATTTTTA 


AAAAGACTGG 


ACAATCCTGT 


TCCAGTCTCA 


480 



120 
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TCAGTTGTTA TTTACCAAGT TTTGCTTTAG CT 
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512 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 



CTGGACCACA 


CTATTTTCTG 


TGTTGGCTAT 


CGTGTCATGC 


AAAAGGATCT 


AGAAGGGACG 


60 


CTGGATGCTG 


AAAAACTCAA 


GGCTGCTGGT 


GTTCCGTTCG 


GCCCGCTTTT 


TGGTAAAATC 


120 


AAAAACGGCC 


AGGATCTTGT 


TTTGGAAGAC 


GGAACTGAAA 


TCAAGGCAGC 


AGACTATATC 


180 


TCAGCGCCAC 


GTCCAGGTAA 


GATTATCACT 


ATTTTAGGAG 


ACACTCGAAA 


AACGGATGCC 


240 


AGTGTGCGTC 


TGGCTGTCAA 


TGCAGATGTC 


CTAGTTCATG 


AGTCCACTTA 


TGGCAAGGGT 


300 


GATGAAAAAA 


TTGCTCGTAA 


CCATGGTCAC 


TCAACTAATA 


TGCAAGCTGC 


ACAAGTAGCG 


360 


GTAGAAGCAG 


GTGCCAAACG 


CCTCCTACTC 


AACCATATCA 


GTGCCCGTTT 


CCTCTCAAAA 


420 


GATATAAGCA 


AACTCAAGAA 


GGACGCTGCC 


ACAATTTTTG 


AAAATGTCCA 


TGTGGTCAAA 


480 


GACTTGGAAA 


AAATGGAAAT 


CTACCAGTCA 


CAGAAAGGAT 


AAGTATGCCT 


ACTATTCTCC 


540 


ATTAACCGGA 


A 










551 



(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: 

CCGTCATTAT ACCTTTGTCT ATGAAAATGA AGACCTAGTC TATGAGGAGG AAGTCTTATG 
ATACAGCCAG CAAGTTTAGA AGAATTAGCA TCTTTAGTGG AAAAAGCGGG CAAGAAGGTC 

TTCCTTTTTG TGGCAGACTG GTGTGGCGAT TGTCGTTATA TTTATCCTGC CTTACCAGAG AOU 

ATTGAGGAGA CCAATCCAGA GTTCACCTTT ATTCGAATGG ACCGAGATCA GTATATGGAT 240 

TTGGCCAAAC TCTGGGATGT TTACGGAATT CCTAGCCTTG TTGTTCTAGA AAAGGACAAG 300 

GAAATTGGTC GTTTTGTCAA TCGCGACCGT AAAAGTAAGG AGCAAATTAA CGATTTTTTA 360 

GCAGGATTGA AATAGGAGAA AAAGGAAACA ATGATTTTTA CGTATAACAA AGAACATGTC 420 
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GGTGATGTCC TTATGGTCAT CGTGAAAAAT AGCGGAGATG CCAAACTGAA TGTGGAACGC 
AAAGGCAAGG TAGCCCGTGT TTTCCTCAAA GAAAATGGGG AAACAGTAG 



480 
529 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 



CTAGGATAGA 


CCGCTTTTTA 


GCATTTATCT 


AAGCATTCCA 


GAGTACATGT 


ATCTTGCATG 


60 


TGCTCTTTCT 


TTTGGGGTTG 


AAACGATAGG 


AGAAGGAAAT 


GTTAGAATTG 


CTTAAATCAA 


120 


TCGATGCTTT 


TGCTTGGGGA 


CCGCCCCTCT 


TGATTTATTG 


GTCGGAACAG 


GGATTTACCT 


180 


AACCATGCGG 


CTAGGACTCT 


TGCAGGTTTT 


GCGTCTGCCC 


AAGGCCTTTC 


AGCTTATTTT 


240 


TATCCAGGAT 


AAGGGACATG 


GTGATGTATC 


CAGTTTTACA 


GCTTCTTGTG 


TAACAGGCCC 


300 


TTGGGCATTC 


AAACTGGTTG 


GGAAACAGGG 


AAATATCCAT 


TAGGGAGTTG 


GCGACGGCTA 


360 


TCAAGGTTGG 


TGGACCAGGA 


GCTCTATTTT 


GGATGTGGAT 


GGCGGCTTTC 


TTTGGAATGG 


420 


CTACCAAGTA 


TGCGGAAGGA 


CTCTTGGCCA 


TCAAATACCG 


CACCAAGGAC 


GACCATGGTG 


480 


CAGTAGCGGG 


AGGTCCCATG 


CATTATATCC 


TTCTAGGGAT 


GGGAGAAAAG 


TGGCGACCAC 


540 


TTGCTGTTTT 


GTTTGCAGTA 


GCAGGAGTAT 


TGGTTGCTCT 


CTTGGGAATC 


GGAACCTTCA 


600 


CCCAAGTCAA 


CTCGATTACA 


GAATCTATCC 


AAAATACAAC 


GACGATTTCG 


CCAGCCATCA 


660 


CAG 












663 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 727 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

CCTTGACCAC CAATTCTTCA CCCGATATGA TGCGCCCTGC TCGCTGAGCA AGCCCCAAGA 60 

GATTACTTAT CTTTTGCTTA TTCAAGTCCC AACTCTCTTC TTTTCACTTT GTGATCCACA 120 

TAAGCGATCA ACTCGTCATA AAAGCTTTCT TCCACTTCCA TGCTAAAGCT GCGGTTAAAG 180 

ACCTTCTTCT TTTTCGCCTC TAGGGCTTCT GCATTGTCTA GTTTGATATA AGCGCCGCGG 240 
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CCATTGGCCT TGCCCGTAGG ATCAATAAAG ACTTGTCCTT CCTTGTTCTT GACAATGCGG 300 

AGCAAATCAC GCTTATCAAT CACTTCGTTA GACACAACAG ACTTGCGCAA AGGGATTTTT 360 

CTTGTTTTCA TCTTTCCCTC CTCTAGCAGC TTTTATTCTT CTACAGTATC GTTTTCTACT 420 

TCCAACTCTA CTGAAGCAGC GTCTTCCATG GCTTCAAATT CGCTAGCAGA CTTGATATCG 480 

ATACGGTAAC CAGTCAAGTG AGCCGCCAAG CGCACGTTTT GTCCACGACG ACCAATGGCA 54 0 

AGAGAAAGCT TGTTATCTGG AACAACCACC AAGGCACGTT TGCTGTCGTT TTCATCAAAG 600 

ATAACTTGGT CAACCTCAGC AGGAGCGATG GCATTGTAGA TAAATTCAGC TGGATCTGCT 660 

ACCCACTCGA TAACATCGAT ATTTTCTTCG ATTGGTACCA TGCGGTCATT TTTAGCATCG 720 

TAACGAG 727 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 



CTAATGTAAA 


TCTATGGGAT 


AAACGTTAGC 


TTCTTTTAAT 


AGATTATTAA 


GCAATTTTTT 


60 


AAAACAACTC 


ATCAAACAAA 


CTCAACTGGT 


TATCTCTGGC 


ATATTTCCAA 


GAATACCCAT 


120 


CTCATCCATC 


TTTTCAACCA 


AGGTTGATGA 


GAGTCCACCA 


CGCTTGCGTA 


GTTCTGTTTT 


180 


AGAGAGGAAT 


TCTCCCTCTT 


CACGCGCCCG 


CACCAGTTGC 


TTGGCAACGT 


TCTCCCCCAG 


240 


ACCATCCATT 


GCTACAAATG 


GTGGGATAAG 


GGTATCCCCG 


TCGATGAGGA 


ACTCTGTCGT 


300 


CTGACTACAG 


TAGAGATCTA 


ATTTACCAAA 


CTTGAAACCT 


CGTTCCCACA 


TCTCATTGAC 


360 


AATCTCAAGA 


GTTGTATAGA 


GATCGATTTC 


CACATTAGAG 


GCTTCATTGT 


TCTTCCGTTT 


420 


TTCAGAGATT 


TCTTCCATTC 


TGCGCTTGAT 


GACCTCCAAG 


CCCGCACCCA 


TGGTCTTGAT 


480 


ATCAAAAGCC 


TTAGCACGAA 


TGGAGAAGTA 


AGCACAGTAG 


TAATAAATAG 


GATGGTGAAC 


540 


CTTGAAGTAA 


GCTACACGCA 


AGG 








563 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 647 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



C I AAL TTU AA 


AAGCTTTTAG 


AATATCTTCT 


TGACTAATCA 


TATGATTACC 


GCCCAAGGTT 


60 


AAG\I I I lt-Q,A 


AAATAGAGCC 


ATTAAAGATA 


T AGGC TTGTT 


GGGGTAGGTA ATTAATATGA 


120 


CGGCGCAAGA 


CTTTTTTTAT 


CAATGTTTTT 


AATATCCTGA 


TGATTGATGG 


AAATATGCCC 


180 


TTTGTAGGGT 


TCAAAGAAAT 


TGACAATCAT 


TTTGGCTAAA 


GTTGTTTTAC 


CAGAACCACT 


240 


AACTCCAACT 


AGGCTAACCT 


TATCTCCTTG 


TTTAATCGTG 


AGATTAATAT 


CTGTTAAGGT 


300 


ATCTCGTCCA 


AAACCATACT 


TATAAGAAAG 


GTCATCAAAT 


TCAATATCGC 


CCATCAAAAA 


360 


ATGTGAATGA 


ACAGGGTTTT 


CTTGAACTTG 


AAATTCAGAT 


TCGACTAGAT 


AGACTTCGTT 


420 


CAAACGGTTA 


TTAGCGACCT 


TCGCAGATTG 


GAGTTTGGTT 


TGGAGGTTGA 


TAATATTTTC 


480 


CATAGGAGTT 


GTAAAGTAAG 


AAAGAAGTGT 


GTTAAAGGTA 


ATCAGCTGAC 


CGATAGAAAT 


540 


TTTACTCGAC 


ATGACTAATT 


GAGCGCCAAA 


CCATAGGATA 


AGGATATTCA 


GAACTAATTT 


600 


TGTTTCCCCT 


GCTTTAAACT 


CGTTTGTAAA 


ATAGAATATT 


TACTGAG 




647 



(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



CTATCCTNGA 


ACCAATGATG 


CTTGTAACAA 


TCACTGTTCC 


AGAAGAAAAC 


CTTGGTGATG 


60 


TTATGGGTCA 


CGTAACTGCT 


CGTCGTGGAC 


GTGTAGATGG 


TATGGAAGCA 


CACGGTAACA 


120 


GCCAAATCGT 


TCGTGCTTAC 


GTTCCACTTG 


CTGAAATGTT 


CGGTTACGCA 


ACAGTTCTTC 


180 


GTTCTGCATC 


TCAAGGACGT 


GGTACATTCA 


TGATGGTATT 


TGACCACTAC 


GAAGATGTAC 


240 


CTAAGTCAGT 


ACAAGAAGAA 


ATTATTAAGA 


AAAATAAAGG 


TGAAGACTAA 


TCCGTCCTCA 


300 


CTCTAGAAGG 


AAGTCACTTA 


GTGGCTTCCT 


TTTGTCTTTA 


GAAAATACCT 


CTAAATATGG 


360 


TAAAATAGTA 


GAAGAATAAT 


GTGAGGAAAA 


TGAATGTCAA 


ATAGTTTTGA 


AATTTTGATG 


420 


AATCAATTGG 


GGATGCCTGC 


TGAAATGAGA 


CAGGCTCCTG 


CTTTAGCACA 


GGCTAATATT 


480 


GAGCGAGTTG 


TGGTTCATAA 


AATTAGTAAG 


GTATGGGAGT 


TTCATTTCCG 


TATTTTCTAA 


540 


TATTTACCTA 


TTGAAATCCT 


TTTTACAATT 


AAAGAAAGTT 


TGAGCGAAGA 


ATTTCCTNAG 


600 


AANGCCATCA 












610 



(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 base pairs 

(B) TYPE: nucleic acid 

124 
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<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

TATAAGCATA GTACCAGTTG CCATTGACCT GGAACCAACT GTCTTCATAT CTCCATTACT 60 

GCATTTAGGT AGTACCAAGT TGAACCATCT TGATACCAAC CAGTTGCCAT TGCTCCTGAT 120 

GAACGGAGAT AGTACCATTT GTTCCCAAGG TTTTGCCAAC CTGTTTTCAT ATCGCCATTT 180 

GGCTGGTCTA AATAATACCA AGTGGTACCT TCCTGATACC AACCAGTTGC CATTGCTCCT 240 

GATGAACGGA GATAGTACCA TTTGTTCCCA AGGTTTTGCC AACCTGTTTT CATATCGCCA 3 00 

TTTGGCTGGT CTAAATAATA CCAAGTGGTA CCTTCCTGAT ACCAGCCAGT GGCCATTGCT 3 60 

CCTGAGGAAC GGAGGTAGTA CCACTTATTA CCTAGATATT GCCAACCTGT TTGCATAATA 420 

CCAGTTGTTG GATCTAAATA GTACCAAGTC AAATCATCGT TTATCCACCC CGCACGTCTT 480 

TCACCACCAA GGTAGTTTTC TCCATTAATT TCCGTCTTAG CTAGATAATA CCAGTAGACT 540 

GATCATAAAG CCAACCTGTC TCTAAAGAAT GATTTTGATT AAAGTAATAG TTCGTATAAT 600 

AACGNTTCTC TTCTTTATCT TCTGAATCTT CACGTTTTTC CCCGTACTTT CTTCCAACAC 660 

TGTCTTTAGT TTTAATCTCT AATGTTTTCC AACCAACAAA CTCTTGTAGC ACTCCATTTT 720 

TATCGAAGTA GTACCACTCT GAATTTGGAA AACCTTCTAA TCTCATACCA TTTGGGTAAG 780 
GACCAATTGT ACTACCTTTA GATGGAAACG GGATATATTG CCAGCCGACA ACCATCTCTC 
CAGATAGAGA ATCAAAATAA TAGTACTTAC CATCAATCAC TCGCCAGTAG GTTTCTTTGA 
GGTCCCCCTT TTTGTAATAG GTTCTACCGT TTTCTTGGAC AAACTGCCAT CCTTCAGAGT 
TATCTGCAAA TACTGTATTC GTAG 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



840 
900 
960 
984 



(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112 : 

CCTCGTTATG CAGATGAACG TTATTTCTTG TCAAAGAGTC ACAAGAATTT TGTTGATCGT 60 

AATCTTTTTA TTACCATTCG TGACAAGGAA ACCACCTGTA TCAAGCCTTA TCAGCAGGAT 120 

TTGGATTTGC CACATGGTCT GGCCTTGGAT GTTTTGCCTT TGGATTATTA TCCGAAAAAT 180 

CCAGCTGAGC GGAAAAAACN GGTTCGTTGA GCCTTGATTT ATTCACTCTT TTGTGCGCAA 240 

ACTATTCCAG AAAAGCATGG TGCTCTCATG AAATGGGGAA GTCGCATTTT ACTGGGTTTG 300 
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ACTCCAAAAT 




1 LuLAl v, I w» 


A A ft, £ A Aorrr^ 








GATTTGGCTG 


IV r P r T'f ,, TV"* & TV"/"* 
Al lbl uA 1 w 


/"•uTT'iir* ap a & 


TTATVJP'Pr'Afl 




PATCACIAAAr 


420 


AAGTACCCAA 


i LALA1L 111 




/"•rprnmrp/^TTY^r* 
LI 1 X 1^1 


raTTTYiaai^r 






CCTAT1 CLAA 




Tp m A TfTP 
J. i.V_lrWLlL, 


AfiAAPTnCTT 






SAD 


CCACCAGCAG 


ACAAGCAGGT 


AC CGL ATL AL» 


VjA I Jjt, 1\j 1 t„ A 


1 CQjCTCjATAT 


GGATAAGTCT 


£ fl A 


TATACAGAAT 


ACAAGGGAGA 


ATATGGTGGC 


TAAGAAAAAA 


ATCTTATTTT 


TTATGTGGTC 


660 


TTTTTCTCTT 


GGAGGTGGTG 


CAGAGAAGAT 


TCTATCAACC 


ATTGTTTCAA 


ATCTGGATCC 


720 


AGAAAAGTAT 


GATATTGATA 


TTNTTGAAAT 


GGAGCACTTT 


GACAAGGGAT 


ATGAATCTGT 


780 


TCCAAAGCAT 


GTACGCATTT 


TAAAATCCCT 


TCAAGATTAT 


CGCCAAACCA 


GATGGATACG 


840 


AG 
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(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



CTGGCAAATA 


CAAGGTGACG 


ATCATTGGTA 


AATCAGCCCA 


CGGTGCTATG 


CCTGCTTCAG 


60 


GTGTCAATGG 


TGCGACTTAC 


CTAGCCCTCT 


TCCTTAGCCA 


GTTTGACTTT 


GCTGGTCCAG 


120 


CCAAAGAATA 


CCTTGACATC 


ACTGGTAAAA 


TTCTCTTGAA 


CGACCATGAG 


GGTGAAAGTC 


180 


TCAAGATTGC 


TCATGTGGAT 


GAAAAGATGG 


GTGCCCTTTC 


TATGAATGCA 


GGCGTCTTCC 


240 


GCTTCGATGA 


AACAAGTGCT 


GATAATACCA 


TTGCCCTCAA 


CATCCGCTAT 


CCAAAAGGAA 


300 


CAAGTCCAGA 


ACAAATCAAG 


TCAATCCTTG 


AAAACTTGCC 


AGTTGTTTCT 


GTTAGCCTGT 


360 


CTGAACACGG 


TCACACGCCT 


CACTATGTTG 


CCAATGGAAG 


AATCCACTTG 


GTTGCAAACC 


420 


TGGTTGAAAT 


GTCTATGAAA 


AACAGACAGG 


CCTTAAAGGT 


CATGAACAAG 


TCATCGGTGG 


480 


TGGAACCTTT 


GGTCGCTTGT 


TAGAGCGCGG 


AGTTGCCTAT 


GGTGCTATGT 


TCCCAGACTC 


540 


AATTGATACC 


ATGCACCAAG 


CCAATGAATT 


TATTGCCTTG 


GATGATCTCT 


TCCGAGCAGC 


600 


AGCAATTTAT 


GCCGAAGCTA 


TTTACGAATT 


GATCAAATAA 


AACGATAGAA 


GTCTGAGATC 


660 


TTATGCTTGG 


ACTTCTTTTT 


GGAGGGAAAG 


TAGATGTCTC 


AAATCGAAAG 


AATCAAACAG 


720 


GCTATCATGG 


CGGATTCACA 


GAATGCCAG 








749 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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Cii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

CTGGATGAAT TGCAGAAGCG CAACCTACTG GGATTTGTCT TCCAAGATTT TCAACTATTT 
CCTCATCTAT CAGTTCTGGA AAATTTGACT TTATCGCCTG TGAAGACCAT GGGAATGAAG 
CAGGAAGAGG CTGAGAAGAA GGCGAGTGGA CTCTTGGAAC AGTTAGGACT AGGAGGACAC 
GCAGAGTCCT ATCCTTTCTC ACTATCTGGT GGGCAAAAGC AGCGGGTGGC TTTGGCGCGT 
GCTATGATGA TTGACCCAGA AATCATTGGC TACGATGAAC CAACTTCTGC CCTGGATCCA 
GAATTACGTT TGGAAGTGGA GAAGCTAATC TTGCAAAATA GGGAACTTGG GATGACCCAC 
ATTGTGGTTA CCCATGATTT GCAGTTTGGC TGAAAATATC GCACATGTTA TTATTGAAAG 
TAGAACCTCA AATAGGAAGA AAAATGGATT GAAAAAATGG ATGCTTGTAT TAGTCAGTCT 
GAAGACTGCC TTTGTTCTTA GTA 

(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 615. base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

CTGGAGGGCA TTCAGTATTA CTTGAATAGA GGAAAACATC ATCTCCCTTA TGTTCTCAAT 
ATTGGATTTC CTGGTCAGAA AAATGACCTC TTACTCCTTC GGCTAGATTT AGCTGGAATT 
TCAATCTCTA CTGGCTCAGC CTGTACTGCA GGCGTTGTCC AATCCAGCCA TGTTCTTAAA 
GCCATGTATG GCGCAAATTC AGAACGCTTG AAGGAATCCC TTCGCATCAG TTTGTCGCCA 
CAAAATACCG TTGAAGACCT ACAAACCCTC GCAAAAACCT TAAAAGAAAT TATCGGAGGT 
TAGCCATATG GCATTTGAAA AAATCATTCA GTTAAAAAAT TGTCGTTACG ATTACACTCT 
TAGCCCTTCT GTTAAAAAAT TCACCCTCAA AGATAACACC TTTTTTGAAA CTAAGGTTGG 
TAACTATGAA CTGACTCGCC TTTTGGAAAA AGTGCCAAAC AGCGGTGAAG GCTTCCAACT 
CAAAATCATC ATTAACAAGG AACTTACAGG GGCTAAAATC AATATCACTG ACAAGTTTGG 
CCTTCGTCTA GTTGATATTT TCAAATCAGA AGACCACCAT ATTCATCAGG AAAAATTCTA 600 
CTTCCTCATG GATAG lTA 600 

615 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 954 base pairs 

(B) TYPE: nucleic acid 

127 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



PCIYUS97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



CTGCAATGGT 


TAGCGAGGAT 


GGTGAAA1 AO 


AouAldl.1.1 In 


A\- i A 1 1 i loA 


buAA Ia^LAALj 


o u 


CTGACAGGAC 


TGGTATGTCT 


GCTCTTTGGC 


TCTG AAvjCjC. A 


QjAC- tatacc a 


GCTGGAAGTT 


Ton 


TGACAGATGT 


GGCACGTGAG 


AATTATTGOA 






ATTCTCTCTT 


i on 


GATATTGGGT 


TTGGCAGAGT 


TGGATAGTGA 


GTTGGAAAAT 


TACCAAGCGG 


TTATTCAAGC 


240 


CTATGCCCAG 


TTAGATAATC 


GCTCGATTTA 


TGAGCAAACG 


GGCATTTCCA 


CCTATCAACG 


300 


AATTGGCTTT 


GCCTATGCTC 


AGTTAGGGAA 


ATTTGAAACG 


GCTACTGAGT 


TTTTAGAAAA 


360 


AGCCCTGGAG 


TTAGAATACG 


ATGACTTAAC 


AGCTTTTGAG 


TTGGCCAGTC 


TTTATTTTGA 


420 


TCAAGAAGAA 


TATCAAAAAG 


CCACCCTCTA 


CTTTAAGCAG 


CTTGATACCA 


TTTCTCCTGA 


480 


CTTTGAAGGC 


TATGAGTATG 


GGTACAGTCA 


GGCTTTACAT 


AAGGAACATC 


AAGTTCAAGA 


540 


AGCCCTGCGT 


ATCGCTAAGC 


AAGGATTAGA 


GAAAAATCCC 


TTTGAAACTC 


GCCTCTTGCT 


600 


AGCTGCTTCA 


CAATTTTCTT 


ATGAATTGCA 


TGATGCTAGT 


GGTGCAGAAA 


ATTATCTCCT 


660 


TACTGCAAAA 


GAAGACGCTG 


AGGATACAGA 


AGAAATCTTG 


CTTCGTTTAG 


CCACTATTTA 


720 


TCTGGAGCAG 


GAGCGTTATG 


AGGATATTCT 


AGACTTGCAG 


AGTGAGGAGC 


C AG AAAATCT 


780 


TTTGACCAAG 


TGGATGATTG 


CTCGTTCTTA 


TCAAGAAATG 


GACGATTTGG 


ATACTGCTTA 


840 


TGAGCATTAT 


CAAGAGTTGA 


CAGGAGATTT 


GAAGGACAAT 


CCAGAATTTC 


TGGAACACTA 


900 


TATCTATCTC 


TTGCGTTGAA 


TTGGGACATT 


TTGAAGAAGC 


AAAATCCCAT 


GCTC 


954 



(2) INFORMATION FOR SEQ ID NO: 117: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



CTATGAAACA 


TTTTGATACT 


ATTGTCATCG 


GTGGGGGACC 


TGCTGGTATG 


ATGGCTACGA 


60 


TTTCCAGTAG 


CTTTTATGGA 


CAGAAAACCC 


TCCTCATCGA 


AAAAAATCGG 


AAACTTGGAA 


120 


AAAAATTAGC 


TGGGACTGGT 


GGGGGACGTT 


GCAATGTGAC 


CAACAATGGT 


AGCTTAGACA 


180 


ACCTGCTAGC 


TGGAATTCCT 


GGAAACGGAC 


GCTTTCTTTA 


CAGTGTTTTC 


TCCCAGTTCG 


240 


ATAATCATGA 


CATCATCAAC 


TTTTTTACAG 


AAAATGGTGT 


TAAACTTAAG 


GTCGAAGACC 


300 


ACGGACGCGT 


CTTTCCAGCC 


AGTGACAAGT 


CTCGGACTAT 


TATCGAAGCT 


TTGGAAAAGA 


360 


AAATCACTGA 


ACTAGGTGGT 


CAAGTTGCTA 


CTCCAAATAG 


AAATCGTTTC 


TGTTAAAAAA 


420 


GTAGATGACC 


AGTTTGTCCT 


TAAGTCAGCG 


GATCAAACCT 


TCACTTGTGA 


GAAACTCATT 


480 
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GTCACAACAG GTGGTAAGTC TTATCCTTCG ACTGGTTCGA CTGGTTTTGG TCACAATGCT 
CCGCCATTTA AGCATACCAT CACG 



540 
564 



60 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

CTCCGTGGTC TTTTGGTTTG CTACGGGATG TAGGCAAAAG GCAAGTCCAT TTTATCAGCA 

ATAATGGCTC CGTGTGGAAT CCCTGCTGTT GCAGTTCCTG CAATCACTTC TACTTCAGGA 120 

AAGGCTTCTT TGATAGCTTC CACAAAACCA TTTTCAATTA GGGTACGAGT TTCTGGATAG 180 

GCTAGTGTCA CACGATTATC AGTGTAAATC GGTGACTTGA TACCAGATGC CCAAGTGAAG 240 

GGTTCCTCTG GTTTGAGGTA AACGGCTTGG ATTTTCAAGA GGTGGCTAGC GATATCTTTA 300 

GCAAGTGTCA TGGTATTCTC CTTTTATTTT TCTAATCTAG TTCTTTAATT CCAGTCCTGT 3 60 

GTCCATTCAT CCTTGATGGC ATGATAAGCT GCAACAGGAT CCTCAGCTTG GGTAATGGGA 420 

CGTCCCACTA CGATATAGTC ACTGCCGATT TGATAGGCAT CAGCAGGTGT CATGACTCGT 480 

TTTTGATCTC CAACTGCAAC ACCAGCTGGA CCAATCCCTG GTGTCCCGAC AGATAAAAAT 540 

CTGGATTGGT AGCCCTGCTT GATGACTTTG TCACTTCCCT GAGCCGAGCC AAACCAACAC 600 

CCATCCCAAA GCCCAAGCTT CAGCTGTCTT CCTTGGCATA GTTGAATCCA C 651 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

CTTAGTATCT AGAAAAGGAG AAATAAAATG GTTAAAGTAT TAGCAGCGTG CGGAAATGGA 60 
ATGGGTTCAT CAATGGTTAT CAAGATGAAG GTTGAAAATG CTCTCCGTAA GCTTAATCAA 120 
ACAGATTTTA CAGTCAATTC ATGCAGTGTC GGTGAAGCTA AAGGTTTAGC AGTAGGATAT 
GACATCGTAA TCGCTTCTCT TCATTTGATT CAAGAATTGG AAGGGCGAAC TAATGGGAAG 



180 



240 



TTAATTGGGC TTGATAACTT GATGGATGAT AAAGAAATCA CCGAAAAACT CAGTCAAGCA 300 
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ATACAGTAAA AGGTTGGAGG GGGCTGGACA 

CCCTCTTTAA ATAAAGGAGG CAGATATGAA 

GATCCGACTA GGTTTAGAAG CTAACGAATG 

CTTGATTGAA AGTGGGGCAA TTTTGCCAGA 

AGAGTATGGG CCTTACTATA TCTTGATGCC 
AG 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 

CAAACTGAGA GTTATCGTTT CTGTCCTTCT 3 60 

TTTAAAACAA GCTTTAATTG ACAACGACTC 420 

GAAAGAAGCA GTCAAGGTAG CAGTAGATCC 480 

GTATTACGAT GCTATCATTG AATCGACTGA 540 

AGGTATGGCT ATGCCCCACG CTAGACCTGA 600 

602 

ID NO: 120: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



ATTGGGAATT 


TTTGCAACTG 


CAATTGGTGC 


CCTCAGTAAT 


CTATAAAATA 


GATTCAAGAA 


60 


AATTTAGTGA 


CTGGGATTTC 


CCAGCCCTTT 


TTTAAAGTGA 


GAAGAAATAA 


TGAGTATGTT 


120 


TTTAGATACA 


GCTAAGATTA 


AGGTCAAGGC 


TGGTAATGGT 


GGCGATGGTA 


TGGTTGCCTT 


180 


TCGTCGTGAA 


AAATATGTCC 


CTAATGGAGG 


CCCTTGGGGT 


GGTGATGGTG 


GTCGTGGAGG 


240 


CAATGTGGTC 


TTCGTTGTAG 


ACGAAGGACT 


ACGTACCTTG 


ATGGATTTCC 


GCTACAATCG 


300 


TCATTTCAAG 


GCTGATTCTG 


GTGAAAAAGG 


GATGACCAAA 


GGGATGCATG 


GTCGTGGTGC 


360 


TGAGGACCTT 


AGAGTTCGAG 


TACCACAAGG 


TACGACTGTT 


CGTGATGCGG 


AGACTGGCAA 


420 


GGTTTTAACA 


GATTTGATTG 


AACATGGGCA 


AGAATTTATC 


GTTGCCCACG 


GTGGTCCGTG 


480 


GTGGACGTGG 


AAATATTCGT 


TTCGCGACAC 


CAAAAAATCC 


TGCACCGGAA 


ATCTCTGAAA 


540 


ATGGAGAACC 


AGGTTCAGGA 


ACGTGAGTTT 


ACAATTGGAA 


CTTAAA 




586 



(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: 



TTCGACAATC TTGCCCNNGA TCCAAGACAA CCACTTCTCT GTCCGCTCAG CAATAGTCAA 60 
GCGGTGAGCA ATGAAAATCA AAGGTCTTGT CCAAAGCCAT GAGATAATGG ACAATCCGTT 120 
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TTTTTGTCAA AATATCCAAA CTGTTAGTCG CCTCATCCAA AATCAAGACC GGCGCATCTG 180 

TCAAGAGAGC ACGCGCCAAA GCGATTCTCT GACGTTGACC ACCTGAAATC CCTCCCCCAT 240 

CCGAAGTCAA TTCTGTCTGG TAAATTTCAA GGTGGCATGG CGGCTTCGAG ATATTCCTCT 300 

TCGGAATCTC TGAACCAAAT TCGGACCGGC CCGTTAAAGA TATCTTCCCT GTGTCGTCCC 360 

CTCCCTTGGC TCCCAAAAGA AGATTCTCCA AAATCGTTCC GTTAAAGACA TAGGGCTGTG 420 

GAGGCAGATA GATGATGTAC TGGCGTAGGG CTTTTTTATC AATCTGATTG AGATTGACAC 480 

CACCCAGACT AATCTCCCCT TGACTTGGGT CGTAAAAATT AACCATCATC TTGGCCAAAG 540 

TCGTCTTACC TGACCCTGAA ATCCCCACAA AAGCCACCTT AGACCCTTGG GGAACGGTTA 600 
AATTGATATC CGACAAGACG TCTCGACCAT AGCCATACTT GTAATGAACC TGCTTGAAAG 
TCATCTCTCC CTTCATCAAG 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

CCAATTTGGA TTTGCGAGAA GGCGTTATTA TTTCAGATGC TGGTTCGACC AAGTCAACCA 
TTGTGGATGC GGCGGAGCAG TATTTGGCTG GCAAGTCTGT TCGCTTTGTC GGGGCCCATC 
CCATGGCTGG TAGTCACAAG ACAGGGGCTG CTTCGGCAGA TGTCAATCTT TTTGAAAATG 
CCTATTATAT CTTTACAACC TTCCAAAGCC CTGAACAAGT CAAGGACAAC GCTTAAAGGA 
AATGGGAAAG GATCTGCTTT CCAGGTCTTC ATGGCTCGTT TTATCGAGAT TGATGCCAAG 
GAGCATGATC GTGTCACTTC TCAGATTAGC CATTTTCCTC ATATTTTGGC TTCTAGTCTC 
ATGGAGCAGA CTGCGGTCTA TGCTCAAGAG CATGAGAATG GCAAGGCGCT TTGCGGCAGG 
TGGTTTTCGA GATATGACCC GAATTGCGGA AAGCGAGCCA GGAATGTGGA CCTCCATTCT 
CTTGTCCAAT AGCGAGACCA TTCTGGATAG AATTGAGGAT TTCAAGGAAC GTTTGGAAGC 
GATTGGTCAG GCCATTAGTA AGGGAGATGA AGAGCAAATT TGGAACTTTT TTAACCAAGC 
G 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 514 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 



V» ± X V— X \~ 


AGGGGCAGGA 


TATTTATGAT 


TTCCCTCGTG 


CTATGATTAA 


fiGAAGATAAT 

UU/lAUn X Art 1 


fin 


CTGGAGTTTT 


CATTCTCTGG 


TTTGAAATCT 


GCCTTTATCA 


ATCTTCACCA 


CAATGCCGAG 


120 


CAAAAGGGAG 


AAAGTCTGTC 


TACAGAGGAT 


TTGTGTGCTT 


CCTTCCAAGC 


AGCTGTACTG 


180 


GATATTCTCA 


TGGCAAAAAC 


CAAGAAGGCT 


TTGGAGAAAT 


ATCCTGTTAA 


AACCCTGGTT 


240 


GTGGCAGGTG 


GTGTGGCAGC 


CAATAAAGGT 


CTCAGAGAAC 


GCCTAGCAGC 


CGAGGTTACA 


300 


GATGTCAAGG 


TCATCATTCC 


ACCTCTGCGC 


CTCTGCGGAG 


ACAATGCAGG 


TATGATTGCT 


360 


TATGCCAGTG 


TCAGCGAGTG 


GAACAAAGAA 


AACTTTGCAA 


ACTTGGACCT 


CAATGCCAAA 


420 


CCAAGCCTCG 


CTTTTGATAC 


CATGGAATAA 


AGAGTGGCTC 


TTTGTCAAGT 


GTAGTGGGTA 


480 


GGCGAAAAGC 


TACAATCTGG 


AGANTACGAA 


ATTC 






514 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 



CTTGGATTAT 


AAAATACAAT 


CGGCAAGATC 


ATAAGTCCCA 


AGCCTAAAAA 


TATATAGAAA 


60 


GGGGGTCACC 


TTCCAAAGAA 


ATTCTGTATT 


AAAGAGCATG 


ACCACAAAAC 


CAATCACAAG 


120 


CCCCAAGGCA 


ATCCAGGCGA 


CCTGCTGCCC 


TAAAATGGGC 


AGAATATTAT 


TGGGGTAATC 


180 


ATGACTAACG 


GCTATATAGA 


TAGCCACCAC 


ACCGATGACC 


AGTAGAAAAA 


ATACTGGCAA 


240 


GAGCAAACTG 


TAATCGACTC 


TTGAGTCGAG 


AGAACGTTTC 


ATATAAACTA 


ACCTTATACT 


300 


TTCATACAAT 


ACTATTTATC 


AAAGTTCATT 


AAAAAATCTA 


TCAATAGCCT 


CGTCAACTTC 


360 


GGATCGAGAG 


ATGGTTTTAA 


CAGTCGCTTC 


TTCTGCTAGA 


GATGCTACTA 


TTTGTTTGCC 


420 


GTATCGTTTT 


CCGACGATTC 


TCCTATCCAA 


AATAAGAGTT 


AAGGAACGTT 


GGTATTCACG 


480 


TCTCATACTT 


CTTCCCAAAG 


CCTGTTTTAA 


ACGAATAATG 


G 




521 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125 : 



CTTGCCTGAT 


GACATCATTG ACTCTTTTTG 


GTATATCATC 


GACCATTTCT 


TAAAAAATGT 


60 


CTTTGAATTG 


GAAGAAGAAC 


TCGAGTTTCA 


ATTGCTTAAT 


AACCAAGGAA 


AGATTACCTT 


120 


CCACTTTTCA 


AGTCAACACC 


TCCCTACAGC 


CATTGATTTT 


GACTTTAACC 


ATCCTTTCGA 


180 


CCCTCGTTAT 


CCCCCAAGAG 


TACTGGTTTT 


AGACATGGAC 


GGTAGAGAAA 


CTATCCTCCT 


240 


CCCAGAAGAA 


AATGACCTAT 


TTTAAAAACT 


CTAGCCTTCA 


GTTGCAAGTG 


ACTGAAAACT 


300 


AGAGTTTTTC 


TATTTTTTCA 


AAGCATCATA 


CAAGTTGCGG 


ATCGGTTGTT 


TTAATATCGG 


360 


ATGGATAAAA 


TGAGGCGCAA 


TTTCCTGTAA 


GGACTCAAGG 


ACAAAAAGGC 


GTTCCCGCTA 


420 


TGTTAAGGGA 


TGAGGCAATA 


TGAGGTCGTC 


TGTATAAAGG 


ATCTGGTCCT 


CCCACAAAGA 


480 


GCAAGTCCCA 


AATCAATCCA 


AACGAAGTCC 


CCCAATGCAC 


TTCTCTCACC 


CGTCCCCAGC 


540 


TCTTGACTCC 


AATGGCTAAC 


AAGGTTTCNT 


AACAATTCTT 


GTTGCTGGTA 


GCCAAGTTC 


599 



(2) INFORMATION FOR SEQ ID NO:126: 



<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



(2) INFORMATION FOR SEQ ID NO: 127: 



CTATGGGTAA AAAGGATGCC TCTGCTATGC ATGAGATGAG GGCTTCCTTT ATTCAAGGCT 60 
CCATAGAAGC AGGCCATACT GCGGAAAAAT CAGAGCAGGT TTTTGATGTT ATGGAGAAGT 120 
TTGCAGGTTA TGGATTTAAT AGATCTCATG CCTACGCCTA CTCAGCCTTG GCCTTCCAGT 180 
TGGCTTATTT CAAAACGCAT TATCCAGCCA TTTTTTATCA GGTCATGTTA AATTATTCCA 240 
ACAGTGATTA CTTAATAGAT GCACTTGAAG CAGGTTTTGA AGTAGCCTCT CTATCCATCA 300 
ATACTATTCC CTATCACGAT AAAATTGCCA ACAAGTCTAT CTATATAGGT TTGAAATCGA 360 
TTAAGGGGCT CCAGCAAGGA CTTGGCGCTT TGGATTATTG AACATAGACC TTATTCTAAC 
ATTGAAGATT TTATAGCTAA ATTACCTGAG AATTATCTGA AACTTCCTCC TGCTAGAACC 
TTTGGTAAAA GTTGGTCTTT TCGATTCATT TGAAAAAAAT CGTCCAAAAG TATTTAATAA 
CTTAGCTATC TATTGAATTG TGAA 



420 
480 
540 
564 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 







CTTCTGACTA 


TCTACCTCTA 


CTAGATAAAG 

V* Own 4 ****** VJ 


GACAAACTGT 


60 


AAGATTAAAA 


CTGGAGAAGN 


TTGGAAATCA 


CGGCATTACC 


ACCATCGGCC 


AACTTCAGAC 


120 


AATTGATCAA 


ACTCCTACCA 


GAACAGAGCA 


AGGCAATCTC 


TTTAAATTAA 


CCGCTCTTGC 


180 


AAAACTATCT 


AATGAGGATA 


GTAAACTCAT 


CCAATATGGC 


TTACAAGGTC 


GCGTCACTAG 


240 


TGTAACTGCA 


AAGAAAACAT 


ATTTTGATTA 


TTTCAAAGAT 


AAAATTTTAA 


CCCATTCTGA 


300 


TTAATTTTCA 


GATAACACTC 


TATAACTATT 


TATTATCTTA 


TCAAAAAGGA 


GAATCATAAC 


360 


ATGGATAAGA 


AACAAAACCT 


AACTTCATTT 


CAAGAACTAA 


CAACTACCGA 


ACTCAACCAA 


420 


ATTACAGGTG 


GAGAATGGTG 


GGAAGAACTC 


TTACATGAAA 


CAATTTTAAG 


TAAATTTAAA 


480 


ATCACAAAAG 


CACTTGAACT 


ACCTATTCAG 


CTATAAAAAC 


AAGACCGAGA 


AACAAGAACT 


540 


CTCGGTCTTG 


TTTTTTATCA 


TTCTGCATGT 


ATCACAGTAA 


GTACCTGACG 


AAAGACTTGA 


600 


TTTTGGCGAG 


GTAGTATT 










618 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



CAGGGGATGC 


CAGTTGATTT 


GCTTCAAGTT 


GACTTGACTC 


GTACTTGGGA 


AATCCTCGGA 


60 


GAAATCACTG 


GGGATGCTGC 


TCCAGATGAA 


CTCATCACCC 


AACTCTTTAG 


CCAATTCTGT 


120 


TTAGGAAAAT 


AAGAAAAATC 


CATGATCCTT 


CATTCGGTCA 


TGGATTTTAT 


TGTCTTTATT 


180 


AGTAATCTGG 


TCTTAAGACC 


CCTGTTACAG 


TTGCCTTAGT 


TGCTTCGTAG 


TCGCCATCTA 


240 


CGACAACCTT 


GATAATGCGT 


TTGACATCTT 


CTTCTGGTGC 


TGGAACAAGA 


GGTAGACGAG 


300 


TGGGTCCAGC 


TTCAAATCCC 


ATATAGTTAA 


GAATTGCCTT 


AACTGGAGCA 


GGACTTGGAT 


360 


AAGAGAAGAG 


AGCATTAACC 


TTAGGAATGA 


ATTTACGCTG 


AATTGCTGCG 


GCTTTCTTCA 


420 


TATCGCTTTC 


TGCAATGGCA 


GTAAACATCT 


CGTGCATTTC 


ATCCCCATTT 


GTATGAGAAG 


480 


CAACAGAAAT 


AACCCATCCG 


CCCCAAGGTT 


CATGGCATGG 


AAAGCATCTC 


CATCCTCACC 


540 


TGTATAAAT 












549 



(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 455 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

CTGGCTANCA AAATCATACT CATAATCCTT AACACCGACA GAGATAGAAG CAATGAGCCC 
TTGATCGTGC ATTCGTTTAA TAAAAGGAAT GCGTCCTGCC TCATCAAAAC GGTGCATAAG 
TGTACAAGTA ACCACCTTTA GCCAGTTGCT CTGCTACATT TTCATCCAAA ATCGTCTGCA 
TATTCGCTGG CACAACAGGT AGTTTAAAGG TGTGATTTCC TAAAGTGACA CTTGTATCCG 
CTTCTGCACG GCTTTTAATG ACACATTTAT TTGGAATCAA TTGAATATCT TCGTAATCAA 
AAATTGGAAA TTCATTTAAC ATATCGATGT CTCGTTTCTT TTGTAATGAC CTACCTATGC 
TCTCGCATCA CTACGCCTTT TCCGACGTTT CCCTTAAATT TATTATAAAC CAAAAGTACA 
GTTTTTGTCA AATTATTTCA TGAATNAAAT ATATC 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



CTCCTGGGTT TTGATTAACT CTAGGGTTGC CAAAAAGAGG GTGATGACTT CTTGGACATT 
CTGGGCTTCC TTGAACAAAT CCTGCAAGCG CAATTGATTT CGTCCAATCA AGGATTCTTT 
CACAATAATC ATCATGTCCT CAATCTTATA CTCATCCCGC AAGATAGTCG TGTGATTTTG 
TGCAAACTCC TCTTTTTTCT TGGCTAGGAT ATTTGAAAAA GCCAAAAAGA GGTCAATGGT 
CGTCTTGTCA TGCACAAGCT CCGCATCTTC GTAAATCAAC TCTATCGGCG CTTTGGAATA 
ATACTGGGCC CGTTTCTTGG TGCTTGGCTT CCAAGTGCTC ACCCAAGAGC TTGAACTTGC 
GATATTCTTC GATTTGAGAG AGGAGGTCCT GCTCCAGGTC ATCCCCCAAG TCTGTCACTT 
CTGCTACCTT CGGAAGGAGT TTACGACTCT TAATCAGCAT GAGCTGACTA GCCATGACCA 
TGTACTCACC CGTCACTTCC AGACGCATGG CCTGCAGGGT TGAGACATAG GCTAGATACT 
GTTCGATGAC TTCCGTAATG GGCACATCGT AGATATCCAT CTGGTACTTA GAAACCAGAT 
GCAAGAGTAA GTCCAGGGGT CCTTCAAAAT CTTTTAATTT AATATCCATT ATCTATATTT 
TTCTAAGGTC AGGACTGTTT TTAATCCTAA TTTTTTTGCA ATTTCGTACA AATCGACCTT 
GTTTTCTATT TGTCCTTAGA ATAAACTGTT CACGTAAGAC TTGAGATCGA ATTCCT 



60 
120 
180 
240 
300 
360 
420 
455 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
776 



(2) INFORMATION FOR SEQ ID NO: 131: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



ATTGATACCN 


ATTGTACAGG 


ACAACTTATT 


TTGGAGCGCC 


ATAGCAGGGA 


TANGGTTAGC 


60 


AATAGATGCG 


NTGACCATAG 


AAACGTTTNT 


TAGCCAATAC 


ATAACCTGCT 


AGAGATGAGG 


120 


TTGCACAAAC 


TAAGAACATG 


GTTACCAATG 


AGATAAATAC 


TGAGTTCCAC 


ATCCATTGCA 


1B0 


AGGCAGGGTT 


CTGCACCATG 


AGTTGTTGGA 


AGTTTTCCAT 


GGTTGGCATT 


TTAGGGAACC 


240 


ACTGAGGAGG 


AATAACAATT 


GTATCAGGTT 


GTGATTTGAA 


TGCCCCTGTC 


AAAATCCAGT 


300 


AGAATGGAAA 


GATGNACAGC 


ACAGTCAACA 


AGAGCAAAAT 


GATTGTTGAA 


ATNACAGTAA 


360 


AGGCTGTTAA 


TGGTTTTTTT 


TCTGTAGATT 


GCATAGCTGT 


CTCCTTTCTT 


TAGTATTCTA 


420 


CGTCGTTTCC 


AAGTACTTTA 


AATTGAACAA 


AGCTTACGAT 


AGCAATCATG 


ACTGCCAAGA 


480 


AGACACCAAT 


TGTGTTGGCA 


TAGCCGTATT 


CTGTCAATTG 


GAAGGCTTTT 


TCGTAAAGGT 


540 


AGTACATCAA 


GGTACTTGTT 


GAGTAGTTTG 


GACCACCAGA 


TGTCAAAAGC 


TGAATCAAGG 


600 


CGAAACACTG 


GAATGAGTTA 


ATTGTTGTGA 


TGATTGCAAT 


ATAAAGAGTT 


GTTGGAAGAA 


660 


GGCTTGGCCA 


TTTAATCTTC 


CAAAAAACTT 


GAAACTCAGT 


TGCACCATCA 


ACACG 


715 



(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



CTGGGCTGAG 


TTTTCCAGAT 


ACAAATACTG 


CAACCGCAAC 


TGCAATCCCC 


CAACCCATAG 


60 


TAATCACAAT 


CCAACCTGCA 


CTGTTGCTCT 


TGGTTTTTGG 


AAGAACCACA 


CCTGCAACAA 


.120 


CACCATTTCC 


TAGAAGAATC 


AGGATTAAAG 


TCCCTAAAAA 


TTCTCCAAAT 


AATTCACTCA 


180 


TCATTTTTCT 


GTCTCCATTA 


AAAAGAAGGG 


GCGGGCGACA 


AGGATTGCTA 


CCCTCCACCT 


240 


CTTTTATTTT 


TTCTTAATTT 


TTTAATTCTG 


CTAAGTCGTT 


TTGAGCAAGA 


GCTGCTTTTA 


300 


CATCAGCACG 


GTAAGTTGCT 


TTTTCTTCTT 


CTGTCCAGTC 


ATAGAATCGT 


CCCATTTCAT 


360 


CCAAAACTGG 


CTCAACGATA 


CTATCCAAGC 


TATCACGCAT 


AAAGAGCATG 


TGATTGGTAC 


420 


GACGAAGAAG 


GAAGTCAACT 


GGGCTAAGAG TCAACTCATT 

136 


GCGCATTGCA 


TAGTGAAGGG 


480 
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ACAAG 



485 



(2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 



(2) INFORMATION FOR SEQ ID NO: 134: 



60 



CTTAATTATA ACTTAAACAC AAAAGTTTTA CACAAACTGT GGATAACTCT TTTGAAACTG 

TGATTTTCTT AATTGAAATC TATTTTTTAT TTTGTGAATA AGATGTGAAA AAATAGAGAA 120 

TATGTTAGAA TAGAGTCATG AAAATTAAAG TTGTAACAGT TGGGAAACTG AAAGAAAAGT 180 

ATTTAAAAGA TGGTATCGCA GAGTATTCAA AACGAATTTC TAGATTTGCT AAGTTTGAAA 240 

TGATTGAGTT ATCAGATGAA AAAACACCAG ATAAGGCCAG TGAATCAGAA AATCAAAAGA 300 

TTTTAGAAAT AGAAGGTCAG AGAATTTTAT CAAAAATTGC TGACCGTGAT TTCGTTATTG 360 

TGTTAGCCAT TGAAGGGAAA ACTTTCTTCT CAGAAGAATT TAGTAAGCAG TTAGAAGAAA 420 

CTTCTATAAA AGGANTTTCT ACTCTTACTT TTATTATTGG GGGAAGTTTA GGATTGTCAT 480 

CATCTGTAAA AAATAGAGCC AATCTTTCTG TCAGTTTTGG TCGCCTAACC TTGCCTCATC 540 

AGTTAATGAG ACTAGTTCTT GTTGAACAAA TCTATCGCGC TTTTACGATT CAGCAGGGAT 600 

TCCCCTACCA TAAATAGAGA ATTGACTTTT AATTGAATTT TTGGTAGAAT AATTGTGTTA 660 
GGTCTCATAG 



670 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



CTATAATTTT ATTAGCTGTA TTCCAGTCGT CACGACCAAA CTCTGTTACA GGGACACGAA 6 

TGTCAAAACG GTTCTCAATC TCCACAATCA ACTCAACCGT TCCCATACTA TCCAAGACAC 12 

CTGCATCAAA AAGATCTTCA TCCATCATGT CAGAAACATC TTCCATAAAC AACTCATCAA 18 

TAATTTCGAT AACTTCTGAT TTGATATCCA TATTTTATTT CCTTTTATTT TTTAAACCAT 24 

AGATTATTCA AGAATCCAGA AAAGATTAAG AATGACAACA TGACAACATG GAAAGTGACA 30 

137 



WO 97/37026 PCT/US97/05306 

ACCATGCCAA GCAACTGAAT CCAGCGATTC TCAGGTAGGG CAGCCTTCCC TGCTTTTTTC 360 

CGTTCCTTAT TGAGCGTTTT TTTCTTGCGA ACCCAGGCAT CATTGATGAA CAAGCCTAAG 420 

TCCATGAAAG AGTCCCATAG GCGATATAGT ACCAGGTCAC ACCATGCCAA AATCCCCATA 480 

ATCCAGCATA TTTACAATGT AAGATC 506 

(2) INFORMATION FOR SEQ ID NO: 135 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 



ATGGTTCGAC 


CCACAAATAT 


GTTTTGCGAA 


TCATTGACCA 


TTCTATGATT 


TACGTTGCCA 


60 


TTGCCGGCTC 


ATACACGCCC 


GTTGTCTTGA 


CCTTGATGAA 


TAACTGGTTT 


GGCTATCTGA 


120 


TTATTGTCAT 


CCAATGGGGA 


ACGACCATCT 


TTGGTATTCT 


CTATAAAATC 


TTTGCTAAAA 


180 


AGGTCAATGA 


GAAATTTAGC 


CTTGCTCTTT 


ACCTGATTAT 


GGGCTGGTTG 


GTTCTGGCTA 


240 


TCATTCCTGC 


CATTATCAGT 


CAAACNACAC 


CCGTTTTCTG 


GATTCTCATG 


GTAACTGGCG 


300 


GACTCTGTTA 


TACAGTTGGA 


GCTGATTGAT 


AGCCTTTATG 


CATTGAAATT 


TGACAGTTTG 


360 


ACTTATGAAA 


ATAAAGCAGA 


GGTTATGGAC 


TTTATCAAGG 


CTCGTGTTGA 


TAAGATGATG 


420 


GGCTCTACTC 


CAAAAGATAT 


CAAGGGAAGC 


AGTTCTTGCA 


GGTTCAAACT 


TTGTTGTGGC 


480 


AGATATGTTT 


GGAAGCAGCA 


AGTGCTCTCG 


TAGAAGTAAG 


CAAGGAAGAA 


GATTTTAAAC 


540 


CATCTGTTGA 


ATCACTTTCT 


CGTGCCTTTA 


ACCTGGCTGA 


GAAGGCAGAA 


GGGGTTGCTA 


600 


CAGTTGATTC 


AGCACTATTT 


GAGAATGACC 


AAGAAAAAGC 


TTTGGCAGAA 


GCAGTAGAAA 


660 


CACTCGTTTT 


ATCAGGACCT 


GCAAGTCAGC 


AATTGAAACA 


ACTCTTTGCG 


CTTAGCCCAG 


720 


TCATTGATGC 


TTTCTTTGAA 


AATACTATGG 


TAATGGCTGA 


AGATCAGGCT 


GTCCGTCAAA 


780 


ATCGTTTGGC 


AATCTTGTCA 


CAACTAACCA 


AGAAAGCAGC 


TAAGTTTGCT 


TGTTTTAACC 


840 


AAATTAACAC 


TAAATAAAAT 


TTGATAAACG 


GACTTTATCT 


TATTACAAAG 


GAGAAGAAAT 


900 


GGATCCGAAN 


ANATTGCTCG 


T 








921 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



CTTGGCAACA GGCTCTTTTT GTATTGCGTT TATTAAGAAA AACAAAAAAC ATCGATATAA 60 

CATTTAACTA AAAATTATTG TATGTTATCT CCCTTAATTA GGAATGATAA GGGAATAACT 120 

AGAAAGATTT GTGAATACAA ACTATTTCTG ATATACAAAA TATACAGTAA TAATGAATGA 180 

TGGGAGATGG GATGAAAGAA TTTCAATTTG AGAGAAAGCA GCGTTTTTCT TTGAGGCCAT 240 

ATGCAATAGG AGCTTGTTCG GTCTTGCTAG GAACGAGTTT ATTTTTTGCT GGTATGGGTG 300 

CTCAGCCTGT ACAGGATACA GAAACGAGTT CAGCACTAAT TTCAAGTCAT TATTTGGATG 360 

AGCAGGATTT ATCTGAAAAG CTGAAATCTG AGTTGCAATG GTTTGAATTA GAAAACAAGC 420 

TTTTGAACTT ATGGGAACAT TAGGTTACTA TGAAGGATTT GTTCCTTATG TTTCAAATCA 480 

ATACAAAAAC CAAGCTGAAG AAGAAGGCAA ACCGCTATCT GATAAATATA TTTTCGAAAA 540 
ATCTTAAGAA AAACATATGC ACCTTCAAAA A 



(2) INFORMATION FOR SEQ ID NO: 137 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



571 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

GTATCAACTC CAAAGAGCAA GTGACTGTCA AAGTTGTAAC AGATGCGGCC AAGAAGCTCA 
TGGGTGATAA GATTGCTCGC CAGAAAGAAC GTGGGATTCA GATTGANACC TTGCGNACCA 
TGATTATNGG GAT TCCANAC GNTGGTAAAT CCACTCTGAT GAACCGTTTG GCTGGTAAAA 
AGATTGCTGT TGTTGGAAAC AAGCCAGGGG TCACAAAAGG TCAACAATGG CTTAAAACCA 
ATAAAGATCT GGAAATACTT GGATACACCG GGGATTCTCT GGCCTATAGT TTGAGGATGA 
AACTGTTGCA CTTAAGTTGG CATTGACTGG AGCTATCAAG GATCAGTTGC TTCCTATGGA 
TGAGGTTACC ATTTTTGGTA TCAATTATTT CAAAGAACAT TATCCAGAAA AGCTGGCTGA 
ACGCTTCAAA CAAATGAAAA TTGAAGAAGA AGCGCCTGTG ATTATTATGG ATATGACCCG 
CGCCCTCGGT TTCCGTGATG ACTATGACCG TTTTTACAGT CTCTTCCGTG AAGGAAGTCC 
GTGATGGCAG ACTCGGTAAC TATACCTTAG ATACATTGGA AGACCTCGAT GGCNACGATT 600 
GAACTAAGTC C 

611 

(2) INFORMATION FOR SEQ ID NO: 138: 



60 
120 
180 
240 
300 
360 
420 
480 
540 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1054 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



TTGCTAAAGC 


AGCCATGATG 


ANGGCACTCC 


ATAAGGATAA 


GTACCACCAA 


GCAGCAGCAG 


60 


ANGACCATAA 


TCTCCTTTAT 


GACTTGAACG 


AGAACGTTCA 


ATAATAACTT 


TTTCTAGTAA 


120 


GGTTTGATTA 


ATCACTTTCA 


TCCTTTTTCC 


CTCTCACTTT 


TATTATACAA 


CAAAAAGGAG 


180 


ACGCAGACCT 


CCTTTTGTAA 


TCTTATATCT 


AAAATTTAAT 


ATTCATTTCT 


GCCATTTTAG 


240 


ATATAGCTAT 


AGAAAATACA 


CTCTATTAAT 


CGAATGTTTC 


TCTTATTTTC 


TATCCAATGT 


300 


CCGAAGTGCT 


GCTTGATAAG 


TTTGCTCCAT 


CAGCATGGTA 


ATGGTCATAG 


GACCGACACC 


360 


TCCAGGGACT 


GGCGTGATAT 


GGCTAGCAAG 


TGGTGCAACT 


GCCTCATAAT 


CAACATCTCC 


420 


ACAGAGCTTC 


CCATTTTCAT 


CTCGGTTCAT 


CCCAACGTCA 


ATGACAACCG 


CACCTGGTTT 


480 


GACAAAGTCA 


GCAGTCACAA 


ACTTGGCGCG 


GCCGATTGCG 


ACTACAAGAA 


TATCTGCTTT 


540 


AGCAGCCACC 


TTGGCAAGAT 


TATGAGTTCG 


TGAGTGGGCC 


AAGGTTACTG 


TCGCATTTTT 


600 


AGCCAAAAGA 


AGCTGAGCCA 


TAGGTTTTCC 


AACGATATTT 


GAACGACCGA 


TTACGACCGC 


660 


ATTTTTACCT 


TCCAAGTCAA 


TCCCATATTC 


ATGAAACATT 


TCCATAATTC 


CTGCAGGTGT 


720 


CGAGGGAATC 


ATGACTGGAT 


GTCCAGACCA 


AAGACGTCCC 


ATGTTTAGGG 


GATGGAAACC 


780 


ATCCACATCC 


TTTTCTGGGT 


CAATGGCTAA 


TAAAACCGCC 


TCTTCATCGA 


TATGTTTTGG 


840 


TAATGGCAAC 


TGGACCAAAA 


TCCCATGCCA 


AGCTGGATCC 


CTGATTATAT 


TTAGCAATCA 


900 


GGTCTAACAA 


TTCCTCTTGA 


GTTAATGGTC 


TCTGGAACTC 


GC ACT AC TTC 


GCTACGGGAA 


960 


CCAGCCGCAA 


GAGCTGACCT 


CTCCTTGTTG 


CGAACGTTAA 


ACTTGGCTGG 


CTGGATTATC 


1020 


CCCAACCAAA 


ATCACTACCA 


ACCAGGCACT 


AGAG 






1054 



(2) INFORMATION FOR SEQ ID NO: 139: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



CTGAACTGGA 


TTTTGCTACA 


AGGTCTGATT 


TGTCAAAGAG 


ATGCGAGTTA 


TGATATGAAG 


60 


CAGGATGATT 


TGGATAAGGT 


AGCAGATTAT 


CTCTTCAAAA 


CAGAAGAATG 


GACCATGTAT 


120 


GAGTTGATTC 


TTTTCGGTAA 


CCTCTATAGT 


TTCTACGATG 


TAGACTATGT 


CACTCGGATT 


180 


GGTAGAGAAG 


TTATGGAGAG 


GGAGGAATTT 


TACCAAGAGA 


TTAGTCGCCA 


TAAGAGATTA 


240 


GTGTTGATTT 


TGGCCCTCAA 


TTGTTACCAG 


CATTGTTTAG 


AGCATTCTTC 


TTTTTATAAT 


300 


GCCAACTATT 


TTGAGGCTTA 


TACAGAGAAG 


ATTATTGACA 


AAGGTATTAA 


GCTTTATGAG 


360 


CGTAATGTTT 


TCCATTATTT 


AAAAGGTTTT 


GCCTTATATC 


AAAAAGGACA 


GTGTAAAGAA 


420 


GGCTGTAAGC 


AGATGCAAGA 


GACCATGCAT 


ATTTTTGATG 


TGTTAGGTCT 


TCCAGAGCAA 


480 
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GTTAGCCTAT TATCAGGAAC ACTACGAAAA ATTTGTCAAA AGTTAATTTT CCCAAATAAG 
GGAAAAAATA AAAAGCTCCT TTCGGTTTTG ATACAATAGT TTCAAAATTT GAGAGGAG 



540 
598 



(2) INFORMATION FOR SEQ ID NO: 140: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 



CTACTTCCCG 


GCCGAGTCGA 


TTCTCCGCCG 


ATATCGAGCA 


TACTGGCTCC 


TTCTGTTATC 


60 


AATTTACGAG 


CCTGCTGGAG 


CGCCTGCTCA 


AGAGCAAAAA 


ATTGACCACC 


GTCCGAAAAG 


120 


GAGTCTGGGG 


TTACATTGAT 


AATTCCGCAA 


ATAGCTGTCT 


TTGCATGAGT 


GGCTTTACTG 


180 


GACATATCGG 


TCACTCCCTC 


AAGGCTATTC 


ATCATATTAT 


TTCTCTATTT 


TACCATAAAA 


240 


AGAAAAAGAT 


GGACACGATT 


TCATTCATCT 


TTCTCCCAGT 


AGAAACAAGT 


. AAGC AATTGT 


300 


CAATAATCTT 


AAACAGAAAT 


CCCTAATGTC 


CGACTCATAA 


TCACCACAAG AGCCAACAAA 


360' 


CAGAAAGCAA 


TCCCATTAAC 


AATCATGTGA 


AGTAAGATCG 


ACATTTCCAA 


ACGTTGGGTC 


420 


TTGTAGGCTG 


TCCAAGATAG 


AACTGTCGAC 


ATACCTCCAT 


AAATCAATAA AGAAGGTAAA 


480 


TTACTTGGTT 


GATGTAATAA 


AGCAAACACA 


ATCGTACCGA 


CTACAAAT 




528 



(2) INFORMATION FOR SEQ ID NO: 141: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

CTATGAGAGA GAATGACCTT CTCTTGATTA CTGCGGACCA TGGAAATGAC CCAACGTATG 60 

CAGGAACGGA TCACACTCGG GAATATATTC CATTGTTGGC CTATAGCCCT GCCTTTAAAG 120 

GAAATGGTCT CATTCCAGTA GGACATTTTG CAGATATTTC AGCGACTGTT GCCGATAACT 180 

TTGGTGTGGA AACTGCTATG ATTGGGGAAA GTTTCTTAGA TAAATTGGTA TAAGATGACG 240 

CGCTATGCTT TGCTGGTGAG AGGTATCAAT GTTGGTGGTA AGAATAAGGT CGTCATGGCG 300 

GAGCTTCGTC AAGAATTGAC AAACTTGGGA CTGGAAAAAG GTTGAGAGCT ACATCAATAG 360 

TGGCAATATT TTCCTTTACT TCGATAGATT CCAAAGCCCC AATTGGTTGA AAAAGCTAGA 420 
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GACTTTCTTT GCAGTCCATT ATCCATTTAT TCCAGAACTT TTTCCTTTAC TGAGTCTAGA 
GGACTTTGAA GGGCGGAACT TGAAA 



480 
505 



(2) INFORMATION FOR SEQ ID NO: 142: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



CTATTTTCTC 


ATCTTTTTTA 


GACTGGTAAT 


GTATCCCCTA 


GAGTTTCATT 


GTTAAGACCT 


60 


GTCGTATTGA 


CAACAGTGGA 


TCCTGATTCC 


CCATTCCAAG 


AGTTTTGCCC 


GCATCATATN 


120 


GACGAAATCT 


TTTTCTGAGC 


CAGCAATTTT 


CTCAGCTAGG 


GCAATAGCGG 


CGCTGTTGGC 


180 


ACTAGATACC 


AGAGTTGCTT 


CAAGCAACTC 


TTCGACAGTA 


TAATTACGGG 


CCTCCATAGG 


240 


AATATTACTG 


GCTTCAGAAT 


TTGTCGTCAA 


TTGATAAGGA 


TAATCAGAAA 


TATCTACAGG 


300 


AGTGGAGAGG 


GTAATACTTC 


CGTTTTCCAA 


AGCTTCATAG 


ACCAGATAAA 


CAGTAATCAA 


360 


TTTTGTTATG 


GAAGCAATTT 


CGACAGGTGG 


CGTTGCATCC 


TTCTCATAGA 


GAATTTTACC 


420 


AGTATTTGCC 


TCAACAGCAA 


TCGCATGTTT 


AGCGGCAATG 


GTAAAATCTT 


GAGCAACAGC 


480 


AGTAGAAGCA 


CCCCCTAAAA 


GAGAGACAGT 


TAACAAAGTT 


AAAAATATTT 


TTTTCATAGT 


540 


AGTCTTATTC 


TATCATAAAG 


AAAAAAAATA 


TTCTTGCTTT 


AATAATTCAT 


CTGTTAAGCT 


600 


TTTTGAAAAT 


ATGGTAAAAT 


AAAGTGAGGG 


AGGTAACTCA 


TGTTTCGTAG 


AAATAAATTA 


660 


TTTTTTTGGA 


CCACAGAAAT 


TTTAC TCTT A 


ACCATCATCT 


TTTACCTATG 


GAGACAGATG 


720 


GGGTCTTTGA 


TTAACCCTTT 


TGTTAGCGTG 


CTTAATACAA 


TTATGATTCC 


ATTTTTATTA 


780 


GGGGGCTTTC 


TTTATTATTT 


GACAAACCCT 


ATTGTTACTT 


TCTTAAATAA 


AGTCTGTAAA 


840 


CTCAATCGTT 


TGCTTGGTAT 


TTTAATTACC 


TTGTGT 






876 



(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
CTTGTCAAAT GCCATGGAAG GGGTTCTTTA CTTCCTCAAA CCAGACTTTT CAAAACTGAC 60 
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TAGCGCTGAT CTCCTCTATG CTCTGGGACA ATCTTTCTTT GCCCTCTCAC TAGGGGTTAC 120 

AGACATGTTG ACCTATGCTT CTTACTTGGA CAAGAAAACC AATCTAGTCC AGTCAGGAAT 180 

CTCCATCGTA ACCATGAATA TCTCGATAGT CCATCATGGA AGGTCTAGCC ATTTTCCCAG 240 

CCATGTCAGC CTTCAATATC CACTCTGAAA GGGGACCCAG CCTGCTCTTT ATCGTCTTGC 300 

CTCAACTCTT TGACAAGATG CCTTTTGGAA CCATTTGCTA CGTCCTCTTC CTCTTTGCAA 360 

CTGTCACTTC TTCTGTCGGT GATGCTGGAG ATCAATGTGG GCAATGTCAC CAACCAGGAT 420 

AACAGCAAAC GTGCCAAATG GAGTGTTATT TTAGGAATTT TGACCTTTGT CTTTGGCATT 480 

CCTTCAGCCC TATCTTACGG TGTCATGGCG GATGTTCACA T 521 

(2) INFORMATION FOR SEQ ID NO: 144: 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

CTGCAGACAA TAACTTGGTT AGCTTAACGG CTCTTGAGGA TTCATCCAAG GATGTAACCT 60 

TTACCAGTTC GGCTTTCAAT CTAAAAGAAG GGCGACACCT TCAAAAAGGG GATTCCAAGA 120 

AAATCCTTAT CCACGAAGAA TTGGCTAAGA AGAACGGTCT TTCACTTCAT GACAAGATTG 180 

GCTTGGATGC TGGTCAGTCT GAATCTGGAA AAGGACAAAC AGTAGAGTTT GAGATTATCG 240 

GCATCTTTCC TGGTAAAAAA CAAGAGAAAT TCACAGGCTT GTCTTCTGAC TTCAGTGAAA 300 

ATCAAGTCTT TACAGATTAT GAAAGTAGCC AAACCCTTTT GGGCAATAGT GAAGCTCAAG 360 

TCAGTGCAGC ACGCTTCTAT GTTAGAAAAT CCTAAGGAAA TGGACGGACT CATGAAGCAG 420 

GTAGAAAACT TGGCCTTGGA AAATCAAGGC TACCAAGTCG AAAAGGAAAA CAAGGCTTTT 480 

GAACAAATCA AAGACTCAGT TGCAACTTTC CAAACCTTCC TGACCATCTT CCTTTATGGA 540 

TGTTGATAGC AGA 553 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: -linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
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CTATGGATGC GGAAGTTTCT AAGAACCTTC GCTTGATCCT TGAGCGTAAA GGAATGACCA 
TCTTGACTGG TACTAAACTG CAAGAAATCA TTGAGGAAAA TGGTCAACTT CGTATCAAGG 
TTGAAGGAAA AGACAATATC ATCGCAAGCA AAGCTCTTCT TTCAATTGGT CGTATGCCAG 
ACCTTGAAGG TATTGGAGAG GTTGAGTTTG AATTGGATCG TGGTTGTATC AAGGTCAATG 
AATACATGGA AACTTCAGTT CCACGCATTT ATGCCACCAG GTGACATCAA CGGTACTAAG 
ATGTTGGCTC ACGCAGCTTT CCGCATGGGT GAAGTTTCCG CTGAAAATGC CCTTAAAGGA 
AATCATGCAG TTGCCAAATT GAATTTGACT CCTGCAGCCA TCTACACTCT CCCTGAAGTA 
GCAGCAGTAG GTTTGACAGA AGAACAAGCC CGTGAGAAAT ACGATGTTGC CATCGGTAAG 
TTTAACTTTG CTGCTAACGG TCGTGCTATT GCATCTGACG CAG 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



CTGCAGGTGG 


AATTATGAAT 


GTCGGATATG 


AAAAAGCATT 


CTTGATGCAG 


ACATCGTTAA 


60 


ATTTGCCAAC 


TTCTGAAATT 


ATCTCGACAT 


ATGTCTATAA 


AGTTGGTCTT 


GTATCAGGAG 


120 


ACTATTCTTA 


CTCAACAGCG 


GTTGGTTTGT 


TTAATGCAGT 


GATTAACGTA 


GTATTGCTTG 


180 


TTGCAGTTAA 


CCAAATCGTT 


AAACGCATGA 


ATAATGGTGA 


AGGAATTTAA 


GGAGGAAAGT 


240 


ATGAAAAATT 


CGATTATGGA 


TACAAAATTT 


GATAGACGTA 


TCTTACTCTT 


AAATAAAATC 


300 


ATTATTGTCT 


TTATCGTTTT 


GATGACTTTG 


CTTCCTTTAC 


TTTATATCGT 


CGTAGCATCC 


360 


TTTATGGATC 


CTAAGGTTCT 


GGTTAGTAGA 


GGGATTAGCT 


TTAATCCAGC 


CGATTGGACT 


420 


GTAGAAGGTT 


ACCAGCGTGT 


TATTCAGTGA 


CCAATCTATT 


CTAAGAAGTT 


TTATCAATTC 


480 


CCTACTATAC 


TCTTTTGGAT 


TTGCAGCT 








508 



(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
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CTAAAAAGAA CCAGATGGAT CTTGTGATGA ATTTACATCA TTTTGATTTA CCAGTGGAAC 60 

TTCTTCAAAA ATACGGTGGT TGGGAAAGCA AACATGTAGT GGAGTTATTC GTGAAGTTTG 120 

CCAAGACTGC TTTCAACATG CTTTGGAGAT AAGGTTCATT ACTGGACAAC TTTCAATGAG 180 

CCAATGGTCA TTCCAGAAGC AGGATACTTA TATGCTTTCC ATTATCCAAA TCTAAAAGGA 24 0 

AAGGGAAAAG AGGCCGTACA AGTCATCTAT AATCTAAACC TTGCTAGTGC AAAAGTGATT 300 

CAACTATATC GCTCATTAGG ACTTGATGGA AAGATTGGGA TTATTTTAAA CTTGACACCT 360 

GCTTATCCAA GAAGTAATTC TCCAGAAGAC TTAGAAGCAA GTCGATTTAC AGATGACTTC 420 

TTTAACAAAG TCTTCCTTGA ATCCAGCTGT TAAAGGAACT TTCCCAGAAA AGATTGGTAA 480 

AAACAGCTAG AGAGAGATGG CGTGTTATGG AGTCATACCG AAAAAGAG 528 

(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

ACTCTTATTG GTGAGGTATC GCAACTTCTG TTTCTGCCTT CCTTGGTGGA CCAGCCAATA 60 

CAACTTACGG AGAAAATACA GGGGTTATCG GTATGACTCG TATCGCTTCT GTCTCAGTTA 120 

TCCGTAACGC TGCCTTCATC GCGATTGCCC TCAGCTTCCT TGGTAAATTC ACTGCCTTGA 180 

TTTCAACTAT TCCAAACGCT GTACTTGGTG GTATGTCAAT CCTTCTCTAT GGGGTTATCG 240 

CCAGCAATGG TTTGAAAGTC TTGATTAAAG AACGTGTGGA TTTCGCTCAA ATGCGAAACC 300 

TCATCATCGC AAGTGCTATG TTGGTTCTTG GACTTGGGAG GAGCTATCCT TAAACTTGGT 360 

CCAGTTACAC TTTCAGGTAC TGCCCTTTCA GCCATGACAG GAATCATCTT GAACTTGATC 420 
TTGCCATACG AAAATAAAGA CTAAGAGTCT AAATACACCT AATCCACTCA GACAGCTGAG 
TGGATTTTTC GTATACCATA ATAAAAGTGT CTTAACAAAA TTATTAAAAT CAAAAAACGT 
ATAATATCAG ATATTCTAAA ACCTTGATAC TGTACGTTTT ATCATAGAAA TTTTTACTTT 
ATTTTCTCAT CAAATGAGAT TTGCATCAAT CTCTTGTCTT ACTTGCGTTT CTTCTTCGCT 

TTCTTCATTT TGTTAGCCAT ACGTTTCATG GACTGTTTCA TGGCAAATTC ACCAATTTTA 720 

CCTTTCAAAC CGCCACCAAA CATCTGGCTC ATATCTGGCA TTCCTGCTCC TCCGAGAG 778 

(2) INFORMATION FOR SEQ ID NO: 149 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



480 
540 
600 
660 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 149: 



GGGAGTTGGT 


TGCTATTNGT 


GAGGGAAAAG 


GTTTGAACTT 


TGAAATCGAA 


GTGGATGGTG 


60 


GGATTGATGA 


CCAAACTATT 


GCTCAAGCCA 


AAGAAGCTGG 


TGCGACTGTT 


TTTGTAGCAG 


120 


GTTCCTATGT 


CTTTAAGGGA 


GAAGTCAATG 


AGCGAGTACA 


AACTCTCAGA 


AAACAACTGG 


180 


ACTAGGGTTG 


CAGTTTTTGC 


AGGCGGAAAC 


CGCGGTCATT 


ATCGGACAGA 


TTTTGATGCT 


240 


TTTGTTGGGG 


TGGATCGAGG 


CTCGCTCTGG 


GTCTTGGAAG 


AAGACTTACC 


TCTTGCTCTA 


300 


GCAGTCGGAG 


ATTTTGATTC 


TGTGACGGAA 


GAAGAGCGAC 


AGGTGATTCA 


AAAACGTGCC 


360 


CAGTATTTTG 


TCCAAGCCCG 


GCCAGAAAAA 


GATGATACTG 


ATTTGGAATT 


GGCTCTCTTA 


420 


ACCATCTTTG 


AACAAAATCC 


TCAGGCTGAG 


GTCACTATTT 


TCGGTGCCTT 


GGGTGGCCGT 


480 


ATTGACCATA 


TGTTGGCCAA 


TGTTCTTTCT 


ACCTAGCAAT 


CCCTAAGTTG 


GCACCCTATA 


540 


TGCGTTCNAA 


TAGAAATTGA 


GGAATGGGCA 


AATTTGATTG 


CCTATTGTTC 


CAGAAGGGAA 


600 


CAGTCAGATT 


CGAATCCCTG 


CA 








622 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



CTATTATCAG 


CACTTTTATA 


CCTATTACCA 


AAAAAATCCT 


CAAAATGATT 


GGCAAAAGGC 


60 


TTTATCCACC 


AGCCATTTAT 


CAGCAATATT 


TACTTGAAAA 


ATATGGTAGA 


AATAGAAAGG 


120 


ATGGAGGAAT 


CTAATGGTAT 


TACAAAGAAA 


TGAAATAAAT 


GAAAAAGATA 


CATGGGATCT 


180 


ATCAACGATC 


TACCCAACTG 


ACCAGGCTTG 


GGAAGAAGCC 


TTAAAAGATT 


TAACAGAACA 


240 


ATTGGAGACA 


GTAGCCCAGT 


ATGAAGGCCA 


TCTCTTGGAT 


AGTGCGGATA 


ACCTACTAGT 


300 


AAATCACTGA 


ATTTTCTCTT 


GAAATGGAAC 


GCCAGATGGA 


GAAGCTTTAC 


GTTTATGCTC 


360 


ATATGAAGAA 


TGACCAGGAT 


ACACGTGTAA 


GCTAAGTATC 


AAGAGTACTA 


TGCCAAGGCC 


420 


ATGACACTCC 


TACAGCCAGT 


TAGACCAAGC 


CTTTTCATTC 


TATGATCCTG 


AATTTATGGA 


480 


TATTAGCGAA 


AAGCAGT 










497 



(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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<ii) MOLECULE TYPE: Genomic 0NA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 



CTGTTATGGA 


GTTGAAAGAA 


GAATACCACC 


CAGACGGTAA AGGTTTTGAA 


ATGATCTGTA 


60 


TCGGTGGGAT 


GGGAGCTGAT 


TTCTTTAAGG 


CTCGCGGTAT TCAACCACTT 


TATGAATTAC 


120 


GTGGCTTGTC 


AGACCAACCT 


AGCTTTGATC 


AAGTTCGTAA GATTATTTCA 


AAAACTGTTG 


180 


AAATGTACCA 


AAATGAACTC 


TTTGATGAGC 


TTTATGTTTG CTACAACCAC 


CATGTCAATA 


240 


CGCTAACCAG 


TCNAATGCGT 


GTGGAACAAA 


TGCTTCCGAT TGTTGACTTG 


GATCCAAATG 


300 


AAGCGGATGA 


AGANTACAGC 


TTGACTTTTG 


AAATTGGGAA ACCAGCCGAG 


AAGAAATTCT 


360 


GGAGCAGTTG 


TTGCCTCAGT 


TTGCAGAAAG 


TATGATTTAC GGTGCCATTA 


TCGATGCCAA 


420 


GACAGCTGAG 


AATGCTGCGG 


GTATGACAGC 


CATGCAAACA GCGACAGATA 


ATGCTAAGAA 


480 


AGTCATCAAT 


GATTTGACAA 


TTCAGTATAA 


CCGTGCCCAG ACAGGCGGAT 


ATTACACAAG 


540 


AAATTACCGA 


AATCGTAGCA 


GTGCCTAATG 


CCTTAGAATA GCTCTAGTCC 


CAGCTCTCCT 


600 


CACTC 










605 



(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

CTACACTTGA GGGAAGTTTG AAAATTCCAT TTTTCTTAAA GGGAATTGTG AAACGCTATG 60 
TATTTTCTTT TTACAACCGG ATGGAGCACT TGGTTGTGGT CAATCCTATG TTTATTGAGG 120 
ATTTGGTAGC AGCTGGTATT CCACGTGAAA AAGTGACCTA TATTCCTAAC TTTGTCAACA 180 
AGGGAAAAAT GGCATCCTCT ACCACAAGAA GAGGTAGTCA GACTGCGCAC AGATCTTGGT 
CTTAGTGACA ATCAGTTTAT CGTAGTAGGT GCTGGGCAAG TTCAGAAACG TAAAGGGATT 
GATGACTTTA TCCGTCTGGC TGAGGAATTG CCTCAGATTA CCTTTATCTG GGCTGGTGGC 
TTCTCTTTTG GTGGTATGAC AGATGGTTAT GAACACTATA AGAAAATTAT GGAAAATCCC 
CCTAAAAATT TGATTTTTCC AGGCATTGTA TCGCCAGAGC GGATGCGCGA AATTGTATGC 
TCTAACGGAT CTTTTCTTGT TGCCTAATTA CAATGAG 



240 
300 
360 
420 
480 
517 



(2) INFORMATION FOR SEQ ID NO: 153: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 base pairs 
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(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



TGTTCGCCTC 


CATATCCNAA 


ACTTGCCCTG 


CCACCATACC 


CAGACTACCT 


GAAGCAAGGG 


60 


ATAAGTTGGC 


AATCGAGCCC 


ACCTNGATCT 


GACTTGGCGA 


ATCTGCCTGC 


GCAATCAAGG 


120 


CATATGGGTC 


TAAGAATAAG 


GATCTCCAGC 


CAAAATGGCC 


ATAGCTTCAC 


CGAATTTCTN 


180 


GTGATTGGTT 


AACCGCCCTC 


TTCGATAATC 


GTCATCATCC 


ATAGCAGGAA 


GGTCATCGTG 


240 


AATCAAGCTC 


CCTGTATGAA 


TCATCTCCAA 


GGCAGTAGCT 


ACCTGCGCGN 


GAGCAGGTTT 


300 


GATGGTAACC 


TGCAAGGCTT 


CCAGAACTTC 


TAACAAGAGA 


AAAGGCCGAA 


TACGCTTGCC 


360 


ACCAGCATGA 


ATAGAATAGA 


GAACAGACTC 


CCGTAAACTA 


GAGGCAAACT 


GCTGGTCTCC 


420 


ATAAAATCTT 


CCAAAGCCGA 


CTCGACAAGA 


GCTAATTTTT 


CTTGCTTTTT 


CATTCAAAAT 


480 


CACTTTCTGT 


TCCGTCTTCT 


TGCATGACCT 


TGACCAAGGT 


CTTTTCAGCC 


TTGTCCAGCG 


540 


TAGCTTGGAG 


CTCTTTTGAC 


AAGACCATGC 


CCTTTTGAAA 


GGCAGTAATC 


GCATCTTCCA 


600 


GAGCAATTTC 


ACCATTTTCC 


AAACTTTGGA 


CAATGGTTTC 


CAGTTCTGCT 


AGATTTTCCT 


660 


CAAATTTCTT 


TTGTTTTGAC 


ATCTTTAACC 


TCTAATTCTA 


CTTGACCATC 


TCGCATCAAA 


720 


AGCGTTACTT 


GGTCTTTTTT 


CTTCAAACTC 


TCAACCGAAT 


CTACAACGGA 


TCTTCTTTTT 


780 


TGACAATAGC 


ATAACCAC 










798 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



CCATGTTTGC 


GGCAGTGACA 


GCAGGTTATC 


AGGCTGCCCT 


AATGGTACCA 


ACAGAAATCC 


60 


TCGCAGAGCA 


ACACTTTGAG 


AGTTTACAGA 


ACCTTTTTCC 


CAATTTGAAA 


CTGGCTCTCT 


120 


TGACAGGTTC 


CTTGAAAGCT 


GCAGAAAAGA 


GAGAAGTCTT 


GGAGACCATT 


GCCAAGGGTG 


180 


AGGCTGATTT 


GATTATAGGA 


ACTCACGCTC 


TGATACAAGA 


TGGGGTGGAG 


TATGCTCGTC 


240 


TTGGTTTGAT 


TATTATCGAT 


GAGCAGCACC 


GTTTTGGTGT 


AGGGCAAAGG 


CGTATTTTAC 


300 


GGGAAAAAGG 


CGACAATCCA 


GATGTCCTCA 


TGATGACGGC 


GACTCCCATT 


CCACGGACGC 


360 


TTGCCATCAC 


AGCCTTTGGA 


GATATGGATG 


TTTCCATTAT 


CGACCAGATG 


CCAGCAGGTC 


420 


GGAAGCCCTA 


TTGTGACGCG 


CTGGATCAAA 


CATGAGCAAC 


TACCTCAGGT 


CTTGACTTGG 


480 


TTAGAGGGGG 


AAATTCAAAA 


AGGTTCCCAA 


GTCTATGTCA 


TCTCTCCTTT 


GATTGAAGAA 


540 
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547 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


155: 






CTCTTTTAAC 


CGTTTTAGCG 


GTGACACCGA 


GGTATTTTTT 


CAGGACCCAA 


GACTTGTCGG 


60 


GCAACCGAAA 


CTGGGAGTTC 


GTCATCTCCA 


ATATGCAGAC 


CAGCAGCATC 


AACCGCAAGA 


120 


CAAACATCCA 


ACCGATCATC 


GATTATCAAG 


GGGACCTGAT 


AGGCATCTGT 


TATTTCCTTG 


180 


ACTTGTTTTG 


CCAGTTGATA 


ATATTGATTG 


GTTGTGAGAT 


TTTTTTCTCG 


CAATTGGACT 


240 


ATGGTAACCC 


CTGAACGGCA 


GGCCGTCTCA 


ACTTTTGCAA 


GAAAGCTTTC 


CACGGAATCT 


300 


TGATAGCGAT 


TGGTTACCAG 


ATATAGTCTA 


AGCGCTTCTC 


TATTCATAAA 


CCTCTCCTTT 


360 


GATGGTATCT 


AGCCAATTTT 


CATCTCTTCT 


TAGGAGCGAA 


AGCTGATTGA 


GTACTTGGTA 


420 


ACGAAATTCT 


TCCAATCCCA 


TTCCTTGAAC 


AACTATTTTC 


TCAGCCAGCG 


ATATTGAGAT 


480 


AAGAGACTGC 


TAAGCAAGAA 


CTTCAAAACC 


AGTCTTTCCT 


TGGCTGAGAA 


AAACAG 


536 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) 


SEQUENCE DESCRIPTION : 


SEQ ID NO: 


156: 






CTGGAGTGAC 


GATTGAGAAA 


ATCTTTGCCC 


TAGAAAATTA 


TCGAAATCAG 


TTAGCTGCTT 


60 


TTCCGCAAAC 


TGTCTGGATT 


TCAGAGGATA 


TTTTGCTAGA 


TTTGGCGGAT 


TCTCAGACTC 


120 


CACAGGGAAT 


TGTTGCCGTG 


GTTCAAAAAG 


AAGAAGTAGG 


ACAAGCTGAT 


TTGAGTCAGG 


180 


GCAAGTTCTT 


GTTTTTGGAA 


GATGTGCAAG 


ATCCTGGTAA 


TGTAGGAACT 


ATCATTCGAA 


240 


CTGCGGATGC 


AGCAGGTTTT 


ACAGGAGTGA 


TTGTTTCAGA 


TAAGTCGGCA 


GACATCTACA 


300 


GTCTTCAAAA 


CCCTACGTTC 


CATGCAAGGT 


AGTCATTTCC 


ATCTGCCCAT 


TTACCGGATG 


360 


ACTAGTCAAG 


CGCTTCTTGA 


CGAAACTAAA 


AAGGTAGCTA 


TCCCAGTGCT 


AGCAACAACC 


420 


CAATCTAAAG 


ATTCTGTTGA 


TTACAGAGAA 


CTGCCTTCTA 


TAGAAAATTT 


TGTACTAGTT 


480 
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ATGGG AAA TG AGGGTCAAGG AATTAGTCCC CTTATGGCTG AAAGTGCAGA CCAGTTGGTC 
CATATTAGCA TGAAGGGGCA GGCCGAGAGT TTGAATGTTG CGGTTGCAGC CGGTATTTTA 
ATCTTCCATT TAAGCTAATT TTAACTTTTT TTGTTATAAT CAAGGAAAGA TGTTCACAGA 
AAAGGAGAAA TGGATGAATC ACACTATTAT ACATGACCGT GCAGGTCTCA ATCAATTTTA 
CGCTAAGGTT TATGCCTTTG TTGGTCTGGG AATCGGACTA TCTGCTTTGG TATCAGGCCT 
TATGTTGACG GTCTTTCAGT CTCAGTTGGT TTACTTTTTG ATGCAGGGGC GTCTCTGGTT 
GACCATTGCT ACTT 

(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



CTGAAGAAAT 


CCTAAAAGAT 


TTGTATTACG 


GAGCCGCTAA 


GAAAATTCAA 


GAAGAAGCCT 


60 


CTCGCTGGGC 


AGGAGTTGTA 


AGAAATGACT 


AAAACAGCCT 


TTTTATTTGC 


TGGTCAAGGT 


120 


GCCCAGTATC 


TACGGATGGG 


ACGGGATTTC 


TATGATCAGT 


ATCCGATTGT 


CAAAGAAACG 


180 


ATTGATCGAG 


CGAGTCAGGT 


GCTCGGTTAT 


GATTTACGTT 


ATCTCATCGA 


TACGGAAGAA 


240 


GACAAACTCA 


ATCAGACCCG 


CTATACGCAA 


CCAGCCATTC 


TAGCGACTTC 


GGTTGCTATC 


300 


TACCGTTTAT 


TGCAAGAAAA 


GGGCTATCAC 


CCTGATATGG 


TTGCTGGTTT 


GTCTCTTGGA 


360 


GAATACTCTG 


CCTTGGTGGC 


AAGCGGCGCC 


TTGGATTTTG 


AAGATGCCGT 


TGCCTTGGTA 


420 


GCTAAGCGTG 


GAGCCTATAT 


GGAAGAAGCG 


GCTCCTGCTG 


ACTCTGGCAA 


GATGGTAGCA 


480 


GTTCTCCATA 


CCCCGTAGAG 


TCCTTGAGAA 


GCCTGTCCAA 


AGCTCTGACT 


TGAGTGGTAC 


540 


TCCAGCCACT 


ATACACACTG 


CACAATCTCA 


TTGCTGGAAA 


ATTGTTGCAG 


TGATCAACGG 


600 


TGACTTTGCA 


GAGCAGTGCC 


AACCCTATCC 


CCTAGT 






636 



(2) INFORMATION FOR SEQ ID NO: 158: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

150 
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CTCTAAAACG 


AATTCTCTAC 


AAGCACCGCA 


AGGGCATGGC 


TGAACTTCCA 


CCATAAGGTG 


60 


UTTTGTC I C 


AAAUdCTAAT 


ACTTCNTTAA 


CCTTAGTTTG 


TCCTGAAAAT 


TGGTACATAT 


120 


1 OAAGACjVjVjC 


CkCCCGTTCT 


GCGCAGAGAT 


GGAAAACACC 


ACAGGTTCCC 


TCCATACAGA 


180 


t\ TV^PTV^Tl 7\ TV TV 

A I v_C Ivj 1 AAA 


1 Al 1 rCl CCA 


TCTCCTGCTT 


CTACTGCAGC 


TACAACATGA 


TTGGCATAAA 


240 






CrGATTGTATA 


GTTTCTGTGC 


TTCTTCCTAC 


ATCTTTTCCC 


300 


AvjA 1 VJ i CO\ 1 


1A1 1\j1AILL 


Hipm TV fTt TV m/TifTt TV 

I C I Al ATTTA 


GAGATTTCTT 


TTAGAATGTT 


TTCGATATGC 


360 


1 VjAA 1 1 un 1 1 


1 A lCAColCC 


IV TV TV 7V /"» TV TV TV 

AAbCAAGAAA 


ATTGTATC TG 


GTAATTCTGG 


CCCATGCATT 


420 


TCGCCTGAAA 


CTGCGATACG 


AATAGGCATG 


AAAAGATTTT 


TCCCTTTAAT 


ACCTGTTTCT 


480 


TTTTGGACTG 


CTTTAATTTG 


CGGGAAGATA 


TTTTCTGTCA 


CAAATTCATC 


ATCTGTCATC 


540 


CGCTTCAATT 


TTTGCTTTGA 


ATGCTTCAAG 


AACTGTTGGA 


ACTGTTTCAC 


CCGTCATGAC 


600 


TTCGCGCTCT 


GCTTCTGTCA 


ATTCCTGGGA 


AATCTGAGAA 


GAAAGATCTG 


TCCATGGGAT 


660 


ATCTCATCTA 


CTGATTCATT 


GTGGTTTATA 


GAG 






693 



(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CTGACAAAGG AGACGGGTAT GGAACAAACA TTCTTTATCA TCAAACCAGA TGGTGTAAAA 60 

AGAGGGCTAG TGGGTGAAGT GTTAAAGCGC ATCGAACAAC GTGGATTTAC AATCGAAAAA 120 

TTGGAGTTTC GTTCACAGGT TTTCAGAAGA GTAGATTGAC CAGCACTATC AGGACCTGGT 180 

TGGTCAGAGT TTTTACCCAC CGATTCGTGA ATTCATGACT TCAGGTCCAG TTCTTGTGGG 24 0 

TGTCATTTCT GGTCCCAAAG TAATCGAAAC TTGGCGGACC ATGATGGGTG CAACTCGTCC 300 

AGAAGAAGCT TTACCAGGCA CTATTCGAGG TGATTTTGCA AAAGCTGCTG GAGAAAATGA 3 60 

GATTATCCAA AATGTTGTAC ATGGTTCAGA TTCCGAGAAG AATCAGCTAA GCCGAGAAAT 420 

TGCTCCTTTG GTTTTAAGAG TGGATTGGCT CAATCAATTG GATAAAAGCT CATTTGAATA 480 

GAAAGTATAG TCAATTAGTT TAAGACATGA CGCATGATAT CAAACTTTTT AGTTTTTGAT 540 

ATGGTGCGTT TTT 553 

(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



CTAGAGATTT 


NCGAAGAGTA 


TTATTGTACT 


TTAAAGGTCT 


TGAGATAATT 


GTCTTTNCCT 


60 


ACTTGACCTT 


CGAAGGTTTT 


ACCATTTTCA 


AGGTAAGGAA 


GGTCATCGGA 


TACTGAAGCC 


120 


TTGACCTTGT 


ATATCTTGCC 


ATCAACTTTA 


AAGAAGTAGA 


CAGTGTCTCC 


CTTGATAACA 


180 


GCTGATTTGA 


GGTCTGCTAC 


TACTCCCTTG 


ATGCTTTCTG 


TCGTTGCATT 


GTCAATTTCA 


240 


AGGTCGTTTT 


TATTGGCATA 


CTTGCTGAGC 


ATCTCTTCCA 


CTGTAGTAGC 


AACGATAACA 


300 


TTTTGGTACT 


CGACTGCGTC 


TACCAGGGCG 


TACTCTTTGA 


CCAAGCCAGC 


ATTGTCCTTT 


360 


AAGCCCATGA 


TGTAAAGAGG 


CTTGTCATTG 


AGGTTGATAA 


AGATTGGGGA 


AAGGTTGCTT 


420 


TGTAGGATTT 


CTCCTGAACA 


GCACCTTCTG 


CTGATTCACG 


GGCTGATTCT 


TCTGTCGCAG 


480 


AAGCCAAG 












488 



(2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:161: 



TACAGCTATC 


GGGTCCGTCT 


GGTAGTTGGT 


CCTCAGAACC 


TTCACTGCCA 


CTTCTTCCCC 


60 


ATCTAAGATT 


AAGTCTTTGG 


CTAGGTAGAC 


ATCCGCCATG 


CCTCCTCGAC 


CAATCTGTTT 


120 


GACAATCCGA 


TAGCGTCCGG 


CAAAAATCTT 


GCCGATTTGG 


ATCATTCTTC 


ATCCTCCTCG 


180 


TTCATAGAAA 


CAAGGGCAAC 


CGTAATGTTG 


TCTAAACCTC 


CTGCATTGTT 


AGCAAAACGA 


240 


ACAAGTGTCT 


CCGTTTTATC 


TGCTAAAGGA 


ATATCACTGG 


TTACAATATC 


ACGAATCTCA 


300 


CTGCCTGAAA 


TCATGTTGGT 


CAAGCCGTCA 


CTATTGAGCA 


AGAGATAGTC 


ACCTGACTCA 


360 


AGGATAACTG 


TCCCAAAATC 


AGGCTGAATT 


TCATCTTTTT 


GCCCAATAGA 


CTGGGTGATA 


420 


ATATTTTTTT 


GTGGATGACT 


TCTGCCTCTT 


CTGGTGTCAA 


TTGACCAGCC 


TTGAGCAATT 


480 


CATTAACCAA 


AGAATGATCG 


CTCGTCAACT 


GATGGTATCT 


TCTCCACGAA 


TCAAGCCGAT 


540 


ACG 












543 



(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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{ii) MOLECULE TYPE : Genomic DNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 : 



CTTTAAGGAA AATCAAATCT CTCATGCTGA TACCTCTCCT CATTAAATTA AATAGTAAAA 60 

AAGATTCTAT CTCACTCCCT GATTATTACA AAACCATTGA AATATCACAA CTAATAGGCT 120 

AGAATGGACA TAGTAAGATN TAGTAGATGA GTCATTCTAC TCAAATCCAC GTTAGAAAGG 180 

ACTGCTATGC CAGACAATCT CGCGCTTCGC ATGCGCCCTA AAACCATCGA CCAGGTCATC 240 

GGTCAGGAGC NTCTGGTCGG ACCTGGAAAA ATCATCCGCC GCATGGTGGA AGCCAACCGC 300 

CTGTCCTCCA TGATTCTATA TGGCCCTCCT GGAATCGGCA AAACCAGTAT TGCCTCTGCC 360 

ATCGCTGGAA CGACCAAGTA TGCCTTTCGA ACTTTCAATG CGACAGTTGA TAGTTAAAAA 420 
GCGACTGCAA GAAATCTCGG AAGAAGCTAA ATTTNCTGGT GGTCTCGTCC TATTGCTAGA 
CGANATTCAT CGACTAGATA GACC 



(2) INFORMATION FOR SEQ ID NO: 163: 



480 
504 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



60 
120 



CTAATAGAGG CGAAAAAATT TCAGGTCCTC CTTTGCTAGA TGATAATGAG GAAACTAAGA 
TTTTACCAAC CTCTTCTTCC CGTTTTGGTT ATGCCAATCC TAAGGATCAT GGTTTTAGCC ^ 
AGGAAACCTT GAAGATTCAG GTCGAACCAT CTATTCATAA AAGCCGTCGT ATTGAAAATA 180 
CCAAGAGAAA TGTCTTCAAT TCTAAGTTGA ATAAAATCTT ATTTGCGGTC ATCTTTCTCT 240 
TGATTTTGCT TGTTTTAGCA ATGAAACTTT TGTAATAGAA AAGGAATTGA AATGAAAATA 
GGAATTATTG CTGCTATGCC AGAAGAACTG GCTTATCTGG TCCAGCATTT AGATAATOCC 
CAGGAGCAAG TTGTTTTGGG GAATACCTAT CATACAGGAA ACCATTGCTT CTCATGAAAG 
TCGTTCTTGT TAAAAAGTGG AATTGGTAAG GTCATGTCTG CTATGAATTG TGTGGCGAAT 
TTTGGCTGAT CATTTCCAGG TTGGATGCCC TTATTAATAC GGGTTCAG 



300 
360 
420 
480 
528 



(2) INFORMATION FOR SEQ ID NO: 164: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(T» TOPOLOGY: linear 
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(ii) MOLECULE TYPE : Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



CTAATAAGAC 


TGAAAAGAAG 


TCCAGCCAGA 


TGTCCCTAAA 


AATACAGAAA 


AAACATTAAA 


60 


ACCAAAGGAA 


ATCAAATTTA 


ATTCTTGGGA 


AGAATTGTTA 


AAATGGGAAC 


CAGGTGCTCG 


120 


TGAAGATGAT 


GCTATTAACC 


GCGGATCTGT 


TGTCCTCGCT 


TCACGTCGGA 


CGGTCATTTA 


180 


GTCAATGAAA 


AAGCTAGCAA 


GGAAGCAAAA 


GTTCAAGCCT 


TATCAAACAC 


CAATTCTAAA 


240 


GCAAAAGACC 


ATGCTTCTGT 


TGGTGGAGAA 


GAGTTCAAGG 


CCTATGCTTT 


TGACTATTGG 


300 


CAATATCTAG 


ATTCAATGGT 


CTTCTGGGAA 


GGTCTCGTAC 


CAACTCCTGA 


CGTTATTGAT 


360 


GCAGGTCACG 


TAACGGGGTT 


CCTGTATACG 


GTACACTCTT 


CTTCAACTGG 


TCTAATAGTA 


420 


TTGCAGATCA 


AGAAAGATTT 


GCTGAAGCTT 


TGAAGCAAGA 


CGCAGATGGT 


AG 


472 



(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 605 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



CTTTTAAATA 


AGAAGAATCA 


CACTCATTGC 


TAAATGTGAT 


ATAATATTGT 


TCTCAGTCTC 


60 


AGCAGTGCGT 


TGTGAAAAAG 


CCATACTAAT 


CCCTGATTTT 


TCCGCATATT 


CCATCACCTC 


120 


AAGTGTTTTA 


CTCACTGAGC 


CAACTTGATT 


AGGCTTTATT 


ACAATACCAT 


CAGCACAATC 


180 


TTTAAACTGT 


GAAATTCTTT 


CTAAGTTTGT 


TGCGTAAAAA 


TCATCACCAA 


AAACTTGTAA 


240 


CTTTAGAGGT 


TTTATCAATT 


GAAATTTTCT 


CCAACTATCC 


AAATCTTCAT 


CGGAAAATGG 


300 


ATCTTCAAGA 


TAAACCAATG 


GATACTTAAC 


TCCCCAATCA 


CAATAAGTAT 


CCATAATTTC 


360 


TGTGACTGTC 


TGTTGCTGAC 


TAACACACCA 


TGGTACTTTA 


TATATGCCAA 


GTGAAGAATC 


420 


GTAACGATCT 


GTCATTGCTA 


AATCTAAACC 


GATATCATAT 


CTATTTTTGT 


ATAATTTTAA 


480 


TGTCTCTAAT 


AACGAATCCA 


AAATAATATA 


AAAATCATCC 


GTGTTAATTA 


TTAATGCACC 


540 


TTGATAGAAA 


TAGATGTTGT 


AGAAACCTGG 


GTTTTAACTT 


TATCATAACT 


CATTTTTATG 


600 


TTAGA 












605 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



CTGAAGAAAA TCAGTAAACA AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT 
TTTGGACAGG AAAAAGACGG TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG 
CAGGGACAAA CACTATCGGG CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT 
AAC TTGTATG CTGCGACAAA TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT 
GGTCTAGCAC CTTATTTCAA TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG 
GATGCTCTTT TTTATGAAAA GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACC 
TGATGATTGG AGATTCTCTA ACCGCCGACA TTCAAGGTGG CAATAATGCG GGGATTGACA 
CTATCTGGTA TAATCCTCAT CACCTCGAAA ATCACACACA AGCCCAGCCG ACTTACGAAG 
TCTATTCTTA CCAAGACTTG CTGGATTGTT TAGATAAAAA TATTCTTGAA AGATCACGTT 
TTAAGGAGAT AG 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
552 



(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



GCATAGAAAA 


ACAGTGATTG 


60 


TATTAAAGAG 


TTGAAAACGA 


120 


AAGCTACCAA 


GAGCAAGATT 


180 


TTTTGAGGAG 


AAATTAGCAG 


240 


TACTGTTCAC 


GATAGGGTGG 


300 


GCCCTCAAAT 


TGGCAGGGAA 


360 


TACAC 




405 



(2) INFORMATION FOR SEQ ID NO: 168: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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{ ii ) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



CTCAAAAACA 


GCTTCTAACA 


GCTTGAACAA 


GGGGGCTAAA 


ATTGATTCCT 


TGATGTAGGG 


60 


TTTGAAGTAA 


GATAATAGGT 


GTTTCATAAA 


TCCCTTCTAT 


TCATATTCTA 


GAAATGAAGA 


120 


AAGTGGGAAG 


CCCCACTCTC 


TGTTTTATTT 


GTTTAAGTAA 


GGCAATAGAT 


AGCCATATCC 


180 


TGCTTTTTCC 


ATCTCATCCT 


TGGCCACAAA 


GCGTAAAGAA 


GCAGAATTGA 


TACAGTAACG 


240 


GAGGCCGCCT 


AACTCCCGCG 


GTCCATCTGT 


GAAAACATGA 


CCCAAGTGAG 


CACTGCCTGA 


300 


ACGAGAACGA 


ACTTCAATTC 


GCTCCATTCC 


ATGGCTCAGA 


TCCTTGTAAT 


AATGAATCAA 


360 


CTCTTTGGAA 


ATCGGACGGC 


TTAAAACTTG 


GCCAACCCAC 


AACCTGAAGC 


AAACTTATCC 


420 


TTGGCAAAAA 


AGAGTTGGCT 


CACCTGTTCG 


TAATATCTAC 


ATAAATCCCC 


TCCTTCCAAA 


480 


GGTTTTGGTC 


CATAGGCATT 


GGTAAATGGA 


CCTCCTGTTA 


CCAG 




524 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



CTTCATCTGG 


TGGTTCTCCA 


GTTGTAAGTA 


ACGTAGTACC 


ATTCTCAAAA 


CCATACTCAC 


60 


AAGTTGGTAT 


TAAGGGGGAA 


CCATATATCT 


TTAAAGGAAT 


GAAATTGCAA 


AAAGATATTG 


120 


TTACAACAAA 


AGAATATAAC 


GAGGTTTTAA 


AAAATGGCAA 


AAAGAAAAAT 


TGGAATCCAA 


180 


TAGCAAATAC 


CAAAAAGAAC 


TAGAAAAATA 


CATTAAATAA 


GGAATGGTAT 


TGATCTTGAT 


240 


AAAATTTTTA 


AAATACTGTC 


ATTTTGAATA 


TAAAGGAGTT 


TGATATGGAG 


TGGATTAGAT 


300 


TAATAGGAAT 


AGCAATCATT 


GTTGTGGGTT 


TTATTTTAAA 


ATTTGATACA 


ATTGCAACAG 


360 


TAGTCTTAGC 


TGGTTTGGTT 


ACAGCTTTAG 


TTTCAGGTGT 


TTCTCTCGTT 


GAATTTTTGG 


420 


AGATTTTGGG 


AAAAGAATTT 


AGCAATCAGC 


GAGTGCTCAC 


GATTTTTATG 


GTTACCTTGC 


480 


CTCTTGTGGG 


GCTGTCAGAA 


ACCTTTGGAC 


TCAACAACGA 


TCAATCGATT 


TGATTCGAAA 


540 


GATTAAAGGT 


CTGACAGTTG 


GAAACTTCTA 


TACAGTTTAT 


TCTTTATCGA 


GAG 


593 



(2) INFORMATION FOR SEQ ID NO: 170: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



PCMJS97/05306 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



CCTGCTCCTT 


ATCTTTGCAG 


TAATTGGCGT 


TTTAGTGGCC 


TTGATAGCTC 


AATTTTACTC 


60 


AGCAAAGGCA 


GCAGTAGGTT 


' TTGCTAAGGA 


ATTGACAAAC 


GATCTTTATC 


GTCATATTCT 


120 


TTCCTTGCCC 


AAGGACAGCA 


GAGACCGTCT 


GACAACTTCT 


AGTTTGGTTA 


CTCGCTTGAC 


180 


TTCGGATACC 


TACCAGATTC 


AGACTGGTAT 


CAATCAATTC 


CTGCGTCTCT 


TTTTACGAGC 


240 


GCCCATTATC 


GTTTTTGGTG 


CCATTTTTAT 


GGCTTATCGA 


ATCTCAGCTG 


AGTTGACTTT 


300 


CTGGTTCTTA 


GTCATGGTTG 


CCATTTTGAC 


CATTGTCATT 


GTTAGGGTTA 


TCTCGATTGG 


360 


TCAATCCTCT 


CTACAGTAGT 


CTCAGAAAGA 


AAACCGAACC 


ACTGGGTTCA 


NGAAACCCCC 


420 


CNCCATTGCA 


AGGATGCCGG 


GTTATTCCGT 


GCCTTT 






456 



(2) INFORMATION FOR SEQ ID NO: 171: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



CTTGTACAGT TGAATCGTTC AGATATTGAT TTTGAAAACA GAG AGTGTGT TGTCTTTGGT 60 

AAAGGAAAGA AGGAGAGACC AGTATATTTT GACGCTCGGA CGAAAATTCA TTTAAGAAAT 120 

TATCTTAACG ACAGAAAAAG ATAGTCACCC TGCTCTTTTT GTAACGCTAG TTGGAAAAGC 180 

CCAGAGACTT GGAATTGCTG GTGTAGAGAT TCGCTTAAGA AAGTTAGGAG ACAAACTCGG 240 

CATACAAAAG GTTCACCCAC ATAAGTTCAG AAGAACTTTA GCGACTAAGG CAATTGATAA 300 

AGGTATGCCT ATCGAACAAG TCCAAAAACT GCTAGGTCAT AGCAAGATTG ACACAACCCT 360 

GGCCTATGCC ATGGTCAATC AAAATAATGT CAAGCATTCA CACCAAAAAT TCATCTCTTA 420 

AAAGCAAATC CCGATATTCC GAAGAGGTGG TTGCATGAAG AAAGTGAAAT TAGGTGAAGT 480 

GGCTACTTTT ATCAATGGCT ATGCTTTTAA ACCTCAAGAT TGGTCCTCTG AAGGAAAGAG 540 
ATTATCCGAA TTACCGAATC TGACT 



(2) INFORMATION FOR SEQ ID NO: 172: 



565 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: 



CTTTATACTT 


TATCATTTCT 


AACTTAATTA 


TAGTCTTATT 


TTTAATAAAG 


TTCAATTATT 


60 


TATATGTAAA 


ATTCCTCATC 


AGTAAAAAAT 


AACCGATCTC 


ATTTCTGAGA 


ATCGGTTTTC 


120 


TAAATAAATC 


AAACCGATCA 


TTTACATAAC 


ATAAATTATG 


TAAATGATCA 


TACTACAACA 


180 


ACAAATCTTT 


GACTTTTCCA 


ATTTCACTTT 


TTGGAATAAC 


CAGGTGAATC 


ATATCACCCA 


240 


GATACATTCT 


GGTTGAGCCG 


TTAACTGTTT 


GGCTCTTGCC 


ATTATGGACT 


TGAGTTGTGA 


300 


TGAGGACGTT 


GTGTGGTAAG 


TTGAGTTCAT 


GAACTTGTTT 


CCCAGCAATT 


TTATCAGAAA 


360 


CTGGTATTTC 


GATAAGTGTA 


ACTTCTCCTT 


CGCTAGATAC 


TTCTTCTGGG 


AAGCATTTTT 


420 


TCCAGCATGG 


CTTCATAGAC 


TGGCGTTCCT 


TTGAGCNAAT 


CCCTGATNAT 


TTTAGAAACC 


480 


AGAGTGACAG 


ACCAGTGGCT 


TAAGGTGCGA 


ATATCTCCTA 


CCATCTCAGT 


TACGAGAT 


538 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGTGCTAACC 


ATGAGAGCAT 


ATATTTCCTC 


CAACTATTTC 


TATTTCAAAA 


AATCCTTACT 


60 


ATATTGTATC 


ACAATCAGAC 


ACAAAAAGAA 


AAAGCAAATG 


ATAAACAAAT 


GCTTTTAANG 


120 


TTTTAAAAAA 


AGCTTCGAAA 


GGTTCTTCTT 


TATTTTTTAA 


NGGGAGAGAT 


AACGTTGATA 


180 


TCTAAATCGT 


GGTCAAAGCC 


GGCAATTTTC 


CTTTAGATGT 


GTATTGGTGA 


ATATCATAAT 


240 


CTAAATCAGT 


TTTAGGACTG 


CTCTCCAAAA 


ATCCTGAGTC 


TGAGCCGTAG 


GACGGAATCC 


300 


AAACAGAGGT 


AAACTTGCCT 


GTATCAATAC 


TGTGTTCTTC 


CATGAAGTAG 


ACACCAACGT 


360 


AGATGCCGAT 


GTTTTTAGCA 


CCTAGTGATG 


CTAGTTTTGC 


TCGAAAGTTT 


TCGACACCTT 


420 


CGTTCATATA 


GACATAGTTT 


TGTCTCCACG 


TCAGCCCATA 


GTAACTAGGC 


TGTTAGGAGA 


480 


GCACATTGTA 


GAAAACTTCG 


GCAGCCTTT 








509 



(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 





SEQUENCE DESCRIPTION: 


SEQ ID NO : 


174 : 






*— X \J X X \ — X \J JL 1 


TGCGACGCTA 


CTTGGTTGTA 


GCATGGGGGT 


GGTTTTAGAT 


GGTCAGTAAG 


60 


TATCTTTTAT 


TAGCAGTTAT 


TTTCTCTGGC 


TTGGTGACTT 


GGATTCTCCG 


TATGATTTCC 


120 


TTCATCTTAG 


TCAAGTATAA 


GGGCTTGCCT 


GCAATCGTTG 


AGCGTTTTTT 


GAAGTTCTTG 


180 


CCCGTTTCCA 


TTATCTTTGC 


CTTGATTCTT 


TCAAGCGTAG 


TGACAGGTAA 


GGTTGGGAGC 


240 


CTTCCTCAAA 


TTAAATGGCT 


AGACTTCTTA 


GCCGTCTTTC 


CAACAGCTTG 


GGTAGCCTTT 


300 


CGCTACCGCA 


ATCTACTCGG 


AACAGTTCTC 


TTTGGAGTGG 


TCTTGATTGC 


CATCTTGCGT 


360 


TTGGTCTCTT 


AAATTACCCA 


CCAAAAAAAC 


TTATCACAGA 


GATAGATATC 


ATATAATGGC 


420 


GTAAATGCTC 


CTTTTCTGTT AAGATTATAA 


GGTATTCTAT 


TTTGGAGGAA 


ATGACATGAA 


480 


AAAAATCGTT 


AAATACTCAT 


CTCTTGCTGC 


CCTAGGACTT 


GTTGCTGCAG 


TGTGCTTGCG 


540 


GCTTGCTCAG 


GGGTGCTCAG 


AAAGAAGAGA 


AC 






572 



(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

CTAGGTTATT TAGTAAAAGA CTAGAAGATT TTCTTTTCCC AATCGTCTTC TGTACGGCGA 60 

GGATAGAAAA ACTCTGACTT GTCGGGAGCT TCCATCATTT CCATCAATGG TAGCATATCA 120 

TAGGCCAGAT TTAAGTTTGG AATCTGGTCT TTTTGCACCC AAGAAACTTC TCCCTCTTCT 180 

GAAGATTGAA GGGTACCAGA GAACTCAGTC GCCTTATAAC AAATGACAAT ATAGCGCCCA 240 

CCTGTATCTA GTGGCCAATT TTTAATGCCG ACAAGTTGAG GATTTTGGAT AGTCAACCCT 300 

GTTTCTTCGT AGATTTCACG AATGACAGAC TCCGCAAAAG CCTCATCATT TTCTACATGA 360 

CCTCCAGGAA AGGCATAACC AGACCAGCGA TTGTTTTCAG GGGCGCGATA CTGCATCACC 420 

ACGCGCTGAG TTTCGANGTC TTCCATCAGA CAGATATTTG TTTAAATTGT TTAATTGGGA 4 80 
ACGGGACATA AATTTAC 

497 

(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



CTGAAATTTT 


GGAAGAAGAA 


ATCGGTGTAG 


ATACAGTCCA 


AAAAATAGGA 


CGCATCTTGA 


60 


TTTTGTTTAA 


ACAATCTAGC 


AAGAAAGAAA 


ATCGCAAGAT 


TTCTAAGAAA 


GTCAAAGAAA 


120 


TCTAAGATCG 


AAACTCCAAA 


TAACTGTTTT 


TATAGAGAAA 


TAAAGGGGAC 


TAGCCTATGA 


180 


CAATCGAACT 


ATTGACTCCC 


TTTACCAAGG 


TAGAGTTGGA 


GCCAGAAATC 


AAGGAGAAAA 


240 


AACGCAAACA 


AGTTGGGATT 


TTAGGGGGGA 


ATTTTAACCC 


TGTTCACAAT 


GCCCATCTCA 


300 


TTGTTGCGGA 


TCAAGTACGG 


CAACAGTTGG 


GACTGGATCA 


AGTTCTGCTC 


ATGCCTGAAT 


360 


ACCAACCTCC 


TCACGTTTGA 


TAAAAAGGAA 


ACCATCCCTG 


AACACCATCG 


TCTCAAGATG 


420 


CTTGA 












425 



(2) INFORMATION FOR SEQ ID NO:177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



CTGCTATCAA 


GACAGTATTA 


CCGCTGAAAG 


AAACAGCTTC 


ACTAGAAGGC 


GCAATTGAAA 


60 


AATAAGACGG 


GATCACGAAC 


TCCCGTTTTT 


CTATAAAAGA 


AAGGAAATGG 


GATGAAAAAA 


120 


TTAGTCTTTG 


TCTGTCTGGG 


AAATATTTGC 


CGTAGCCCTA 


TGGCCGAGTT 


TGTTATGAAA 


180 


TCAATGACAG 


ATAACTACGA 


AATCCAAAGT 


CGAGCAACTT 


CCTCTTGGGA 


ACATGGCAAT 


240 


CCGATTCATA 


AGGGGACTCA 


GGGAATTTTT 


CAAGAGTATG 


AGATTCCTTA 


TGACAAGAAC 


300 


AAGACATCGC 


TTCAGATTAG 


TAAGGAAGAT 


TTTGAAGCCT 


TTGATTATAT 


TATCGGAATG 


360 


GACGCTTCAA 


ATGTTCCGAC 


TTACGTCAGA 


TGTGTCCAGT 


AGACTGTCAA 


GATAAGAT 


418 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear * 
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(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



ACTGTTTGAA 


AAATATGATT 


CGTGGTCGAG 


AAATGAATTG 


CATTTAAGCA 


ATGTAGTTCA 


60 


GTATATAGAT 


TTGGAAATTA 


ATGATTTAAC 


AAAATAAAGG 


AGAAAAAACA 


TGGTTAAATA 


120 


CGGTGTTGTT 


GGAGCAGGGT 


ATTTTGGAGC 


TGAATTGGCT 


CGCTATATGC 


AAAAGAATGA 


180 


TGGAGCAGAN 


ATTACTCTTC 


TCTATGATCC 


AGATAATGCA 


GAGGCGATTG 


CAGAAGAATT 


240 


GGGAGCAAAA 


GTTAGCAAGT 


TCCTTAGATG 


AGTTGGTTTC 


TAGCGATGAA 


GTTAGATTGT 


300 


GTTATCGTCG 


CAACTCCCAA 


ATAATCTTCA 


TAAGGAACCG 


GTTATTAAGG 


CTGCACAGCA 


360 



TGGTAAAAAT GTTTTCCTGT TGAAAAA 

(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

CTTATGAAAT CAATCATCAA AATAATATTG ATCAGGACTA TTTAGGTAAA TTATCTACAA 
CGATTAAATT GGTAGCAGAA AAGGAAAATG CCGTTGAGAT CCTAGAACAC TTGAATGTTG 
TCCCTGTGTT GACAGCCCAT CCAACACAAG TGCAACGCAA AAGTATGTTG GATTTAACAA 
ATCATATTCA TAGTCTTTTG CGTAAATACC GTGATGTTAA GTTGGGGTTA ATCAATAAAG 
ATAAATGGCA CAATGATTTG CGTCGTTACA TCGAAATTAT CATGCAGACA GACATGATTC 
GTGAGAAAAA ATTAAAAGTG ACTAACGAAA TCACGAATGC TATGGAAATA TTACAACAGC 
TCCTTTTTGA AAGCTGTCCC TCATTTGACG ACGGAGTATA AGCGCTTAGC GCAAGCGCAT 
GGTCTGAATT TAAAACAGGC TAAACCAATC ACCATGGGTA TGTGGATAGG TGGTGACCGT 
GAAGGAAATC CATTTGTTAC AGCAAAGAAC TTGAAGCAGT CTGCACTCAC TCAGTGTGAA 
GTCNTCCTGA ACTTACTATG A 

(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 







TGCATTTGCA 


TCAGCAGATT 


TGTTTGAGTA 


TAAATTACAA 


60 


TTAGCAGGAC 


AGACTTGGGG 


ATATTTAGAA 


TTTGAAACAA 


ATACAGAAAA 


ATATGGGAAA 


120 


GTATTGTTAA 


TTATAAAGGG 


TAAGAAGCGA 


CTTACGAACC 


AATTTCCTTT 


GGTACAAAAA 


180 


AATAAGAGTG 


GCTACTTATT 


TGAATATGCT 


CAGATGAATA 


CACTTTATCT 


TAATCAACAT 


240 


TCTTCCTACA 


AAAATGATGA 


AGATAGTCAT 


TCCTTTCCAA 


TTCAGATGGA 


GTTAGTTTCT 


300 


GATGAAATGA 


TTCCAAGAAA 


TTGAACAAGC 


TACTAAAAAT 


TCCGAATATC 


CGAAAAATTT 


360 


ATGATTTTAA 


CTTATGANGC 


GGACTCCCGA 


AAAACAATAT 


TATATCTGTT 


AGATGTTTGT 


420 


TATGCCCTGA 


TGCCCGAACT 


GGTCCAGTTA 


CACTTGATTC 


CAGATTGTCC 


TGAGTATATC 


480 


CATCCAGTTC 


CGTACC 










496 



(2) INFORMATION FOR SEQ ID NO: 181: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 



CCACCCTCTA 


GCAAGTGGTG 


CTAATTCCTA 


TAGCCTAGAA 


GAAAACGAAA 


TCATCTACCA 


60 


AAAGTTAAAA 


AACTTGCTTA 


AAAACTGATA 


AAATACTTGC 


CAAACTTTTC 


AGAATCTGAT 


120 


AGACTAGTAT 


GGTAACAATC 


TATGGCTCGC 


AAAGAGACCA 


TGGCAGAAAG 


GAAATATTGC 


180 


AAAATGAAAA 


AAGATATCCA 


TCCAGAATAT 


CGCCCAGTTG 


TCTTCATGGA 


CACAACTACT 


240 


GGTTACCAAT 


TCCTTAGCGG 


TTCAACAAAA 


CGCTCTAACG 


AAACAGTTGA 


GTTCGAAGGC 


300 


GAAACTTACC 


CATTGATCCG 


TGTGGAAATT 


TCATCAGACT 


CACACCCATT 


CTACACTGGA 


360 


CGTCAAAAGT 


TCACTCAAGC 


AGATGGACGC 


GTGGATCGTT 


TCAACAAAAA 


ATACGGTCTC 


420 


AAATAATGAT 


AAGAGAACAG 


TTTCGGCTGT 


TCTTTTTTGT 


TTCTTGAAAT 


CAACTGCTGT 


480 


TTTCATGTTC 


CAGACTCATC 


TGTA 








504 



(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

CTCAGGCGAT TACTAATTAC ATGACTTCTG CCTCAAACTT TAATGTCGAT GAGGCTAGCC 6 

AATTCATTCA ACAATTTACA ATTACAAAAC AAATCGAACA AGTAGAAAAA CTATTAGAGG 12 

AGTAGCATGG AAACTGCATT AATTAGTGTG ATTGTGCCAG TCTATAATGT GGCGCAGTAC 18 

CTAGAAAAAT CGATAGCTTC CATTCAGAAG CAGACCTATC AAAATCTGGA AATTATTCTT 24 

GTTGATGATG GTGCAACAGA TGAAAGTGGT CGCTTGTGTG ATTCAATCGC TGAACAAGAT 30' 

GACAGGGTGT CAGTGCTTCA TAAAAAGAAC GAAGGATTGT CGCAAGCACG AAATGATGGG 36 i 

ATGAAGCCAG GCTCACGGGG ATTATCTGAT TTTTATTGAC TCAGATGATT ATATCCATCC 42i 

AGAAATGATT CAGAGCTTAT ATGAGCAATT AGTTCAAGAA GATGCCGATG 47( 

(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

CTTATACTTG CTTTTTTCCT TTTGAAAAAT GTTAATAAAT CGTTGGTAGT CTTCTCTAGG 60 
CATGGACAGA TCAATATCGT CGTCCCAAGG GATAAAGCCC TCATGTCGAA CCGCCCCAAT 120 
CAGAGTACCG TAGTTAATAA TATAGTTGAT ATTGTGCTTT TTACAGAGAG TATCAATATA 180 
ATCCAAAATT TCTAATTCAA TTTGTTTGGC ATCTTCAATG GTTAGTTGTT TCATTTTAAA 240 
CTCCTATGAT TTTTTGAATT TATTTTTTAA GGCTAGGACA TGGTTTAAAA ATTCATAGAA 300 
AATGCTATCT TTTGTGAAGA CAAGTAGACT AATATAAGAG ATAGCTGATA ACAAGACAAT 
CAAACCAGTA TTAATCAAAA ATGGCAAATT AATGACCATA TCCACAGGAT ACACGAAATT 
AATCAGGAAA TACATTGCTA CAAAGGAAAG TGAAAAGAGA GAGTATCGAA CAGTATAGCT 
AAAGATATGT CCCAAGTGGA TGAGTTGTTT CTATGGATGA AATGAT 



360 
420 
480 
526 



(2) INFORMATION FOR SEQ ID NO: 184 i 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 



PCT/US97/05306 



ACCATATCAA 


TACTATCGAA 


AAGTACAAGG 


GATNCAGTCT 


C AAGG TC GCT 


bAGGAAbAI I 


o u 


TGAATGACCT 


AGACGATGGT 


GAATTTTACT 


ATCACGAGAT 


TATCGGTTTG 


GAAGTCTATG 


120 


AGGGTGATAG 


CTTGGTTGGA 


ACCATCAAGG 


AAAATCCTGC 


AACCAGGTGC 


TAATGATGTC 


180 


TGGGTGGTCA 


AACGAAAAGG 


CAAACGTGAT 


TTGCTTTTAC 


CTTATATCCC 


ACCAGTGGTT 


240 


CTCAATGTTG 


ATATTCCAAA 


TAAACGGGTC 


GATGTGGAAA 


TCTTAGAAGG 


GTTAGACGAT 


300 


GAAGATTGAT 


ATTTTAACCC 


TCTTTCCAGA 


GATGTTTTCT 


CCACTGGAGC 


ACTCAATCGT 


360 


TGGAAAGGCT 


CGAGAAAAAG 


GGCTCTTTGG 


ATATCCCAGT 


TNTCATAATT 


TTTCGAANAA 


420 


AATGCTGAAA 


AGG 










433 



(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:185: 



CTGCAACTTG 


AATTTC TCCT 


TCTTGTTTCA 


AAGCAAGATA 


AACAATTCGA 


GCCCCTCTTT 


60 


TTTCTAGCAA 


ATCCCCCATC 


TGGACAGATT 


GCATAAAGGA 


ACGAGAAGAA 


ACCTGATCAG 


120 


AATAAGTCTG 


AAACTCTTCT 


TTCGTGAGTG 


TTGTTAGTGC 


CATATACTTA 


CTTTCTATGT 


180 


TTTTTTCTTA 


ATGTTTTACG 


GAAATCAAGA 


GCAAGTCTTA 


ACAGAGGATA 


GAGAGGATGA 


240 


GTGGGCATTG 


TAAATTCACC 


CAAGTATTCT 


TCAATCGTTG 


GATTAAATTT 


TTCCTTAAAA 


300 


TGATAAAGTC 


CACCATTGAG 


AGAGTTTTCA 


ACACCACCTA 


AATTTTGCCA 


GACCATACCT 


360 


CGCTCAAAGG 


CATAGCGAGC 


CGTTTCATAC 


CATGTTAAAA 


TTGGTGCATT 


GTAACGTTTA 


420 


AAATCATCAT 


CCATACCAGC 


ATATATATTG 


ACAGAGGTAG 


TACCAAATTC 


CAAACTCAAA 


480 


GTACCGCTAA 


AGGAATCTCG 


CTTGACCTAC 


ATCTATATAT 


TCCTGCAAGA 


AGGTCAATTC 


540 



(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

164 



WO 97/37026 



PCI7US97/05306 



CTCAGGCTAA AAGAGTCCAC TGGACTCTTT TACTCCGTCC CATAACCAAT GACTTAATGG 
CGCGTTCAGC TAGATTATTG GAAAGGACCA GATGTCCGTC TTTCAAAATA GTCTTAAAGG 
TTTCTTCATA CTTGAGGCTG TATTCAATTG CCCTTCCTAG TTTTGAACCC GATAAAACTG 
ACTGACGCCG GCACCAAGCA AAGAAATCTT CCATTAGGGG TTGGAGCTCT TCTTGACGTT 
TCTGTAGTCG TTCATCAGCT GACAATGTCT CCCAGTCTCT TTCCAAGGCA AATAACTGAT 
CACAATAGGC TAATCCTTTA G 

(2) INFORMATION FOR SEQ ID NO: 187: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CTTGACCAAA CAAGGCAGTT GGAAATTGAT TTTGGAGGAT GAGAGCGCTG GAAGGGAACT 
CTTCCCAATC TTGACTCAAG GTCNATATAT CGCAACATTT GACCAACAAG CGCCANCTAT 
CGATGAAATC TTTAAACTAT AATCAGGAGT GGAAGTATGA GAAATATGTG GGTTGTAATC 
AAGGAAACCT ATCTTCNACA TGTCGAGTCA TGGAGTTTCT TCTTTATGGT GATTTCGCCG 
TTCCTCTTTT TAGGAATCTC TGTAGGAATT GGGCATCTCC AAGGTTCTTC TATG AC T AAA 300 
AATAATAAAG TGGCANTACT GACAACAGTG CCATCTGTAN CATAAGGACT GAAGAATGTA -3 60 
AATGGTGTTA ACTTCGACTA TAAAGATGAA GCAAGTGCCA CCAGAAGCAA TTAAA 



(2) INFORMATION FOR SEQ ID NO: 188: 



60 
120 
180 
240 



415 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

CTATNAAATA GAGTGGTTCT CTATCTGCCA TGACCACAAA TNAGGACAAT GATACTTCTG 
AACGTTCAGA CTGCTATCGT AAAAGGACAG CGGGTGAGAC GCCCATGAAT GATCTAACCA 
GTCATACCCA CGGAGGAAAT TATACAATAG CACGTTATCA GGAGGAAAAA TTTTGGAACA 
AACAGTTGTA AAATATAGTC TGCAGTTATC AATGCTTAAT TTTTTACAGG CGAAGATAAT 
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(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 



CTTGTCATCC 


TCACGAAAAT 


TGCGATTGTA 


AAGCACCAAG 


CCTTGACTCG 


TGATAGACTG 


60 


AATCATGCTT 


CTCTCATGTA 


CTCCTCAAGT 


CGTTTCATGG 


CTTCTTTGAT 


AGTCTCCATG 


120 


CTGGCTGCAT 


AAGATAGGCG 


GACGTAGCCT 


TCCCCGTAAC 


GTCCAAAGGC 


TGCACCAGGG 


180 


ATAAAGGCAA 


CGGCCTTCTT 


CTGAGCAAAA 


TCCTTCAGAA 


AAGCAAAGGA 


GTCTTGATTG 


240 


TAGCCCGCTG 


GAATTTTAGC 


AAAAATATAG 


AAGGCACCGT 


CTGGTTTGAT 


AATCTCAAAA 


300 


CCAAGAGCAG 


TCATTTTTTC 


GATGATATAG 


TCCCGACGTT 


GGATATATTC 


CTTCTTCATG 


360 


GGCTCCGCAT 


CGTTTTTACC 


AGCCGTCAAG 


GCTTCTACCG 


CAGCATGTTG 


CGCCATGGTA 


420 


TTTGC GGC AG 


TGACCAAGTT 


ACTGGTGACT 


CTTGATTAAC 


TGGGCTGTGA 


AGGTCNCANG 


480 


AGCAAAAATC 


AGCCCCAAAC 


GCCAACCTGT 


CATGGCATG 






519 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 468 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 



CTTCAGCTAT 


TCGTGGGATA 


TTAGATGTAC 


TTATTTAGGG 


GTTGAAATCA 


TATGAATATT 


60 


ACCAATTTGT 


TTTCTATCAA 


GACAGGATGT 


GATGAAACTG 


ATAGGCAACT 


GCAAAAACTA 


120 


TTTTTTCAGT 


TGGATTTACA 


ATTGGGAGAA 


TTGACAGATC 


AACTAAGAAA 


ATTAGATTCT 


180 


AATTTTGTTC 


CTCGTAGTCA 


ATTTGTAGAC 


ACGTTGGATT 


TGAATGATGT 


AGAATATAAA 


240 


GAAATTTTAA 


ACTATTTTAT 


CTTCCATCGT 


AATGATAGTG 


AAGAAAGTTT 


GGTAGAATGG 


300 


TTATATGATT 


GGATTTCCAC 


AAATCGTTAT 


GAACTTCCCT 


AAAGAGTTTT 


CCGATTCCGT 


360 


ATGGCTCCAT 


AAATACCCAT 


GAAAGTGTTA 


CTGAAGTTTT 


CCGGAGAATG 


AATAACTAAA 


420 


AAACAGTCAT 


TAGTGACTGT 


TTTTTATAGA 


AAAGAGTTTT 


TATATGTT 




468 



166 



WO 97/37026 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 191: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CTTGAATAGT CTCATCACGA AAATTGCCAT AACTGATAAA ACCAACCACC TTCACGCCAT 60 

CCATCGCAAT CAATGTATTT TCTGGATACT TTTGACTAAA GAATCGACAT CTTTCTAATG 120 

TCATTGTATC CTGAAATTCC GCAGGCAAAA GGTCATCATA AGACTCTCTC CACGTTTGCC 180 

AGTGAACGAG GGATTTGCCT TCTATCTCTT CAGGAGTTNC CATAGATTTG ATAATAACCG 240 

TCATTTATTT TCTCCCAGTC TNCTCTCAAA ATACCATATT TAATACTATC AAAATATTTA 300 

CCTTGATAAT AACGAACTTT NGGAATATGA GCTNCTTTTT TCATNCTTAA TTTTTCAGCA 3 60 

AGTTTCATCA TACCAAGATT TCCTGACCAA GTTGTCAAAA CCAGATGCTC CAACTCCAAG 420 

TAATCCTGAA ACGTCCTATC TATCCACTGC AACATAGCAA CTTTCCCAAT ACCAGTGTTC 480 

CAGAATTTTT TATCATAAAT ACCAATTCCC AATCCATCCA TCTTGTTTCC TTTACATACC 540 

CAATA 545 

(2) INFORMATION FOR SEQ ID NO: 192: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 



CTTGATTGTG GAAGAAGTCC GTGATCGCTA TGTGGGCAAA GTCGATGCCG TTTTTCATAA 

CGGCGATTCT GAACTACGTC CGGATTCTCC ACTTTGGGAG GGCATCCGCG TTGTTAAAGG 1 

GAACATGGAC TTCTACGCCG GCTACCCAGA ACGTCTGGTG ACTGAGCTTG GTTCGACCAA 1 

GATTATCCAA ACTCATGGTC ACTTGTTTGA CATCAATTTC AACTTTCAAA AGTTGGACTA 2 

CTGGGCTCAG GAGGAAGAGG CCGCTATCTG CCTCTATGGT CACTTGCATG TGCCAAGTGC 3 

TTGGTTGGAA GGCAAGATCC TCTTTCTAAA TCCAGGCTCT ATCAGTCAAC CACGAGGTAC 3 

CATCAGAGAA TGTCTCTATG CTCGTGTGGA GATTGATGAT AGTTACTTCA AAGTGGACTT 4 

TTTGACACGA GATCACGAAG TGTATCCAGG TTGTCCAAGG AGTTTAGCCG ATGATGCCAA 4 
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(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



CGTAACCATT 


AACTATGAGG 


TGGTTTGCCT 


CCTCAGCGAC 


CGTATTCCGA 


GAGAATATTA 


60 


TTAGAAAAGA 


AAGGAGTGGA 


GCATGAATCT 


ACATCAACCC 


TTGCATGTCT 


TGCCTGGTGT 


120 


GGGACCAAAG 


TCAGCAGAAA 


AATACGCCAA 


ACTAGGAATT 


GAAAACTTGC 


AAGATCTCTT 


180 


GCTCTACTTT 


CCTTTCCGTT 


ATGAAGACTT 


CAAAACCAAG 


CAGGTGCTGG 


AACTAGAAGA 


240 


CGGTGAGAAG 


GCAGTTCTTT 


CTGGTCAGGT 


AGTGACTCCT 


GCTAGTGTCC 


AGTATTATGG 


300 


TTTCCAAGCC 


GCCAATCCGC 


CTGCGTTTTT 


AGTCCTCCAA 


GCCAGGGAAG 


AAGGTTCGTT 


360 


TTTTTTTGGC 


GGGTGAAATT 


TCCTTTTTAA 


CCCAGCCCCC 


TATCCTTGGG 


CTTGAATAAA 


420 


AAATTAGAAA 


TTTTGGGGAA 


CCAACCCCTT 


TGGCTTGT 






458 



(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 



CCTTACTAGT 


TTACTCAATC 


TTTACATCGT 


TCTTTCGGCA 


GTTCAAACTC 


AGAATCCACA 


60 


GCACAGGTTA 


TGCAATTACT 


GTCTGAAAAT 


ATGTTAAAAA 


CCATTCAGTC 


ACTTTCGGTC 


120 


TGGCAGATTT 


ATTTGCTTGG 


TTTTGAGCGA 


ATCTTGGCGC 


TTGGTTTCCA 


ATTACTTTTG 


180 


ACAGTTTGGG 


TTTACCAAGC 


TGTTCGCCAG 


AAGAAATGGA 


TTTATCTCCT 


AGCAGCCTAT 


240 


GGCTTGCATG 


CCTTCTTTGA 


TCTGGCACCA 


TCTCTTTTCC 


AAGTAGGCTG 


GTTGACAAAT 


300 


CCAGTCTTGG 


TTGAAGTGAT 


TCTAGCACTG 


GAGCTCGTTC 


TGGTCGCCTA 


TGGAACCAAG 


360 


GAAATCTTTT 


GTAAAAAATC 


ATAAAAAGGG 


GGGAACCTCT 


TTTTCTTATG 


CAAAATCCAA 


420 


ACAAGGTATT 


TTTATGGTCG 


TCAAATGTCT 


CTAAAAATGG 


TATAATGGAA 


TGAATTTTGT 


480 


AAAAGGAAGA 


ATGACATGTC 


TGTAAGAGAA 


AAAATGCTTG 


AAATCTTAGA 


AGGAATTGAT 


540 
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(2) INFORMATION FOR SEQ ID NO: 195: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



CTTTGACAAA 


GAGTATGACA 


CCTGACCGTG 


AAGTCATTAC 


CTTTATTCCT 


GAAAAATTTA 


60 


TTGTGGATGG 


TTTCCAAGGG 


ATTCGTGACC 


CACGTGGCAT 


GATGGGGGTT 


CGCCTTGAAA 


120 


TGCGTGGTTT 


GCTTTATACA 


GGACCTCGTA 


CTATCTTGCA 


CAATTTGCGT 


AAGACGGTTG 


180 


AGCGTGCAGG 


TGTTCAGGTT 


GAAAATGTTA 


TCATTTCACC 


ACTAGCAATG 


GTTCAGTCTG 


240 


TTTTGAACGA 


AGGGGAACGT 


GAATTTGGTG 


CTACAGTGAT 


TGATATGGGG 


GCAGGTCAAA 


300 


CGACTGTCGC 


TACAATCCGT 


AATCAAGAAC 


TCCAGTTCAC 


ACATATTCTC 


CAAGAAGTGG 


360 


AGATTATGTA 


ACTAAAGATA 


TCTCCAAGGT 


TTTGAAAACC 


TCTCGCAAAT 


TAGCGGAAGG 


420 


CTTGAAACTG 


AATTACGGGG 


AACCTATCCG 


CCTCTTGCAA 


GCAAAAAACT 


CCATTAAAGT 


480 


TATTGGAGAA 


TTAAACCAGT 


CAAAGTGACG 


GAACCTACTT 


GTC 




523 



(2) INFORMATION FOR SEQ ID NO: 196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 

CTGTGATTTC AGAGAAGAAA TCAAGTGCTG TAACAGAAGT AAGATGTAAT TGTATGTAAA 60 

GGAGACGTCA TGTTAAATAG TATTGTAACC ATTATTTGTA TTGCCCTTAT CGCGTTTATC 120 

TTGTTTTGGT TTTTCAAAAA GCCTGAAAAA TCTGGACAAA AAGCCCAGCA AAAAAACGGA 180 

TACCAAGAGA TTCGAGTGGA AGTCATGGGA GGCTATACTC CTGAGTTGAT TGTCCTCAAG 240 

AAATCAGTGC CAGCCCGCAT TGTCTTTGAC CGCAAGGATC CTTCACCATG TCTGGATCAA 300 

ATTGTTTTTC CAGATTTTGG TGTACATGCG AACCTGCCAA TGGGGGAAGA GTATGTAGTG 3 60 

GAAATCACGC CTGAACAGGC TGGAGAGTTT GGCTTTGCTT GTGGTATGAA CATGATGCAC 420 

GGCAAGATGA TTGTAGAGTA GGTGGAGACT ATGACAGAAA TTGTGAAAGC AAGCTTAGAA 480 
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AATGGCATTC AAAAAATCCG TATCCGAGCT GAAAAAGGCT ATCATCCACC CATATCCA 538 



(2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



CCTCAAGAAG 


TCCGCGATAC 


TTACGAAAAT 


ATTTTGTAGA 


GAATCGAACC 


GCAAGGTTCG 


60 


GTTTTCTTTC 


TCTTTTTGTC 


TATAATTTGG 


TATAATAAAC 


AGTATGAAAA 


TCGTATCAGG 


120 


AATCTATGGG 


GGACGTCCCC 


TCAAGACACT 


AGAAGGCAAG 


ACAACAAGAC 


CTACTTCGGA 


180 


TAAGGTTAGG 


GGAGCCATTT 


TTAACATGAT 


TGGTCCCTAC 


TTTGAAGTGG 


GACNAGTCTT 


240 


GGACCTTTAT 


GCAGGTAGTG 


GTGGTTTATC 


TATCGAAGCA 


GTATCGCGTG 


GCATGTCCAG 


300 


TGCTGTTTTG 


GTGGAGCGAG 


ACCGTAAGGC 


TCAGACCATC 


GTGGCTGAAA 


ATATCCAGAT 


360 


GACCAAGGAA 


GTTGGAAAAT 


TTCAACTCCT 


CCAAAATGGA 


TGCAGAAAGG 


GCATTGGAAC 


420 


AGGTTNTCTG 


GGGAATTTGA 


CCTCGTTTTC 


TTAGACCCTC 


CCTATGCCAA 


GGAACAAATC 


480 


GTAGCAGATA 


TTGAAAAAAT 


GGCTGAGAGA 


GAGCT 






515 



(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



CTGGANACAA 


ACTTTACAGA 


TCAAGTCGAT 


ACGATGATTT 


ATGTTGATAA 


AGAAGAAAAA 


60 


GAAACTATTA 


AAGCTGCACT 


TGTGGAGTTT 


TTTAATGGAA 


AAGTCACTTT 


AACTGACCAA 


120 


GGTTTACGAG 


AGGTTGAAGT 


TCCTGTAAAC 


TTAGTGTAAA 


CAATGAATAA 


TACAGCGTTT 


180 


CGTTGACATT 


CTCACAACTA 


CTTTAGCGAG 


CAAAATAAAA 


AGATGCGTAC 


CAAAATATAC 


240 


TAGAAAATGA 


AGCAATTCAA 


ACGAAACCTG 


ATATCGTTTT 


CCTTCACACC 


TATTTACTAG 


300 


AATTAACTGA 


ACGCAATCAC 


TTGAAAATTA 


ATGACTTTGG 


AT 




342 



(2) INFORMATION FOR SEQ ID NO: 199: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : Genomic DNA 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 



CTATTAGCGA 


CTTTCTCTGA 


AATATGGTAT 


GATAAAGGAT 


ATACAAGGAG 


ATAAAATGAA 


60 


TAATAATTTA 


CTGGTATTAC 


AATCCAGACT 


TTGGTCTGGT 


TGATGGTGCG 


GTATCGGCTA 


120 


TGATTGGAGT 


GGCTTTAGAA 


GAGTCTCCAA 


CCTTAAAAAT 


CCATCACTTG 


ACGCACGATA 


180 


TCACGCCTTA 


TAATATTTTT 


GAGGGGAGCT 


ATCGTCTCTT 


TCAGACGGTG 


GATTACTGGC 


240 


CTGAGGGAAC 


GACGTTTGTA 


TCGGTTGTCG 


ATCCAGGTGT 


CGGTTCGAAA 


CGTAAGAGTG 


300 


TAGTTGCCAA 


GACTGCTCAA 


AAATCAATAC 


ATTGTCACGC 


CAGATAATGG 


GACGCTTTCC 


360 


TTTATCAAGA 


AACACGTTGG 


CNTTGTANCC 


ATTCGTGAGA 


TTTCTGAGGT 


GGCCAACT 


418 



(2) INFORMATION FOR SEQ ID NO: 200: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 200: 



CTCTCGCTTT TCTCATAGTG GGAGGTAAGG ATGGAATTAC GCAGACCAAG ATTAGCGGAT 60 

AAGAAAGCTG TTTTAGATAT GATGACAGAG TTTGAAAAGA GCCAATCAGC CCATGATGGA 120 

GGATTTTGGG ATACAGAGAA TTTTGTGTAT GAAGAGTGGT TGGAAAGCAA TCAGGAACAG 180 

GAAATGGGGA TTAATCTGCC TGAAGGATGG GTTCCTGCAA TTCAGTTAGT GGCTTTTTCT 240 

GAGAAAGGTC AAGCAGTTGG ATTTCTTAAT CTCCGGTTGC GCCTCAGTAA CTTTCTACTA 300 

GAAGAAGGTG GGCACATTGG CTACTCCATC CGTCCATCTG AAAGAGGCAA GGGTTATGCA 3 60 

AAAGAACTCT CCGTCAGGGC TTGCAAGTTG CTAAGGAAAA GAACATCAAG AAAGCTCTGG 420 

TGACCTGTAG CGTGAATAAT CCTGCTAGCA GAGCATCATT CTA 463 

(2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 475 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 



CTCTTACGTG 


ACCAGAACAT 


GCTACTTCCC 


ATCCGAGAAG 


CCATTCTATC 


TGGCTTACCA 


60 


GTGTTTGGGA 


CCTGTGCGGG 


CTTAATTTTG 


CTGGCTAAGG 


AAATCACTTC 


TCACAAAGAG 


120 


AGTTATCTAG 


GAACTATGGA 


TATGGTGGTC 


GAGCGTAATG 


CTTATGGGCG 


CCAATTANGA 


180 


AGTTCTNCAC 


GGAAGCNNAA 


TGTAAGGGAG 


TTGGCNAGAT 


TCCNATGACC 


TTTATCCGTG 


240 


GTCCGATTAT 


CAGTACTGTT 


GGTGAGGGTG 


TANAAATTCT 


AGCAACAGTG 


AACNATCAAT 


300 


TTGTTGCNNC 


CCAAGANNAA 


AATATGTTGG 


TAAGTCCTTT 


TCATCCAGAA 


TTGACTGATN 


360 


ATNTGCGCTT 


GCNCCAGTAC 


TTTATCAGTA 


TGTGTAAAGA 


AAAAAGTNGA 


GATTGAATTT 


420 


CTCCAACTTT 


NCCACNTGTA 


ATNNNCAATN 


NCNATGTATT 


GGAGTACGGA 


CGCAG 


475 



(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 



CCTTCAAAAA 


TTATTTGATG 


CAACCTTAGA 


CAATCCAGAA 


CTCACCGCAA 


CATTAGTGCC 


60 


TTTAGGAGAT 


GGTATTCTCA 


TGCTTCGTAA 


AAATGTAGCA 


GATGTTCAAC 


TGTCTGAAAG 


120 


CGAATGATTT 


TCAGAAAAAT 


TTAAGAAAAA 


ATAGTAAAAT 


AGATAGAGTA 


ACACTTATCT 


180 


CAAAGGAGTA 


GACATGAAGA 


AAAAATTATT 


GGCAGGTGCC 


ATCACACTAT 


TATCAGTAGC 


240 


AACTTTAGCA 


GCTTGTTCGA 


AAGGGTCAGA 


AGGAGCAGAC 


CTTATCAGCA 


TGAAAGGGGA 


300 


TGTCATCACA 


GAACATCAAT 


TTTATGAGCA 


AGTGAAAAAC 


AACCCTTCAG 


CCCAGCNAGT 


360 


CTTGTTAAAT 


ATGACCATCC 


AAAAAGTTTT 


TGAAAAAACA 


ATATGGCTCA 


GAGCTTGATG 


420 


ATAAAGAGGT 


TGATGATACT 


ATTGCCGAAG 


A 






451 



(2) INFORMATION FOR SEQ ID NO: 203: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 



CTATGGATAA 


GGCACTTGCT 


GACCTCAAAA 


CATCAGGGCA 


CTTGCCTATT 


CCGCGACACC 


60 


TACGTGATGG 


GCACTACANT 


GGAAGCAAGG 


AACTGGGGAA 


TGCCCAAGAC 


TATCTCTATC 


120 


CACACAACTA 


TCCTGNAAAT 


TGGGTCAAGC 


AAGACTATCT 


GCCACAAAAA 


ATTCGTAATC 


1B0 


ATCACTATTT 


CCAAGCAGAA 


TATACTGGTA 


AATATGAACG 


GGCTTTGGCT 


CAAAGAAAGG 


240 


AAGCTATCGA 


CCATTTGCGA 


AAAATCTGAA 


ATCCTTTTCA 


AAAAATTGCA 


CTTTCCTCTT 


300 


GATTTTTTTT 


GAAAAAGTGG 


TATCATATAA 


ATATAGAAAC 


GCTGTGGTGT 


ACGACTTCAC 


360 


ACTTAAGTGT 


TGACCGACTA 


TTTTTTGTAT 


TATTANGGAA 


ACAAAAGTCT 


TCTGTCAGCA 


420 


TGTAGGCCGT 


CTCACACGGA 


AACAGCTTCA 


GTT 






453 



(2) INFORMATION FOR SEQ ID NO: 204: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 



CCGTAGTACA GGCACAAAAA GATTTGGAAA ATAGAAAAAG AAAAGCCAAG AAAAAGGCTC 60 

AGAAAACGAA ATAAATAAGG AGGAATCTGG TAATGGTAGT ATTTACAGGT TCAACTGTTG 120 

AAGAAGCAAT CCAGAAAGGA TTGAAAGAAT TAGATATTCC AAGAATGAAG GCTCATATCA 180 

AAGTCATTTC TAGGGAGAAA AAAGGCTTTC TTGGTCTATT TGGTAAAAAA CCAGCCCAAG 240 

TGGATATTGA AGCGATTAGT GAAACGACTG TTGTCAAAGC AAATCAACAG GTAGTAAAAG 3 00 

GCGTTCCGAA AAAAATCAAT GATTTGAACG AGCCTGTGAA GACGGTTAGT GTAAGAAACC 360 

GTTGACCTTG GTCATGTGGT TGATGCTATT AAAAAAATAG AGGAAGAAGG TCAAGGTATT 420 
TCTGATGAAG TC 



(2) INFORMATION FOR SEQ ID NO: 205 : 



432 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 



AAAATCAAGA 


AATTCTTTGA 


AGGAACGACA 


GATACAGCTG 


AGAACTATAT 


CAAGTCGGCC 


60 


CTTAAAATGT 


TGGTCAAATA 


GGAGCAGAGA 


ATGACAAAAC 


GTTGTTCGTG 


GGTCAAGATG 


120 


ACCAACCCGC 


TCTACATCGC 


CTATCATGAT 


GAGGAGTGGG 


GTCAGCCCCT 


CCATGATGAC 


180 


CAAGCATTGT 


TTGAGTTGTT 


GTGTATGGAA 


ACCTATCAGG 


CAGGCCTGTC 


TTGGGAAACG 


240 


GTACTCAACA 


AACGCCAAGC 


TTTCCGAGAA 


GCCTTTCCAT 


AGCTATCAAA 


TTCACTCCAG 


300 


TCGCAGAGAT 


GACTGACACT 


GAATTGGAAG 


CCATGCCTGG 


AGAATCCCAG 


CTATCATTCC 


360 


GAAATAGAGC 


CCAAGATTTT 


TGCTACACGC 


CGCCTAACGC 


CCCAACCCTT 


TCCTACAGTT 


420 


ACAGGCAG 












428 



(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 



CTATTAGATG 


GTATTGACTG 


CCCAGACACT 


TACGGATCTA 


GCTGAGACAG 


GGAATTGTCC 


60 


ATAACCTTCC 


TCATCAATTG 


TAACTTGACC 


TTGGTGGTTA 


CCAAGTAAAT 


CTACAAAGGT 


120 


TTGATTAGTC 


CATTCTTGAC 


CGACAAACAT 


TGACTTGCTG 


TTTTCTTGGT 


CATTTGAAAT 


180 


CAAGACTGCG 


ATTGGGGATT 


GATTTTCAGC 


ACCTGAACGT 


ACCCAACCGA 


TACAGTTAGC 


240 


ATGGTCAAAG 


TAGTCATTTT 


GTTCTCCATA 


GGCCAAATCT 


TTTCGGATGG 


CTAGGAGGCG 


300 


GTCAAGGATT 


TCTTTGAAAT 


CTTGTTGAGC 


ATACTGGCCT 


GAAATCCCAT 


AGTAGTCTCC 


360 


GTAAAAGACA 


CATGGAAGGC 


CGTCTTGGCG 


TAACAAAATG 


AGGGCATAGG 


CTGCTGGCTT 


420 


GAACCATTCT 


TCAACGGTAG 


ACTCAAGGGC 


TTGTCCTCGT 


TGGGTATCGT 


GGTTGTCGAC 


480 


AAAATCACAG 


ACTTGTCAGG 


CTTGAGTTCA 


ACCAGGCT 






518 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 207: 



(2) INFORMATION FOR SEQ ID NO: 208: 



60 



CTCTGTCCTT TTACCTGAGA GTTTGAGCAG TTGCCTGCCT TCCCCCTTCG GTGCCTTTAC 
GGTCTCTCCA GAGTTCCGTC CATTTACAGT CATGGAAAAT CAAACGATTC CCCACTTCTA 120 
TTAAACTTCA TTCGGTGTTG GTATTTAATT GATTCTAATT TTACAAAAAA TGTTGGCTTT 180 
TGTCAATGTG TTTATTAGTA AAAATTAGTT CAACAGTTTT TACTTTATAA AGTCCAGAAT 240 
ACTGCTATCC TTTAAAAGTG ACAATAGTCG CACCACTGCC TCCAGCATTT TGTGGGGCAT 300 
AGCCGAAACT CTTGACATGT TTGTTTCTTT GCAAGTATTT GGTAACTCCT TCACGGATGA 360 
CTCCTGTTCC GATACCATGG ATGATATCAA CTTGAGCCAT ATTGTTAAGC AAGGCTTGGT 
CGAATGAAGG TATCTAGCTC ATTCATGGCT TCTTCATAGC GCTTGCCTCG AAGATTCAGT 
CTAGCTTGAG TCCTCGCCCA GAAGTTCG 



420 
460 

508 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 



CTCTTGGGCT 


CTCTTCTCAG 


CAACTTCCCT 


GCTTTCACCT 


TTGGTCTTGA 


TAGGTGANAA 


60 


CGTAATATNT 


CCAATCNNNG 


TCATATTAGG 


AAANAGTTAA 


ATTGTTGGAA 


AACCATGCCC 


120 


ATCTTCTCAC 


GCATGGGAAA 


CAGGTCATTC 


TTCTTGTCCG 


TAATATCGAC 


TCCCTCAAAG 


180 


ATAACCTTCC 


CCTTGGTTGC 


TTCCTCCAAC 


AAATTCATAT 


AGCGAAGCAA 


GGTAGATTTC 


240 


CCACTCCCTG 


AAGGACCGAT 


GATAACGACA ACTTCTCCTC 


TTTTAATCTC 


GAGGTTGATG 


300 


CCCTTCAATA 


CTTCATTCTT 


TCCAAAAGAT 


TTATGTAATT 


TTCAATTTTT 


ATCAAGGTTT 


360 


CTGTCATTAT 


TTCTTATC 










378 



(2) INFORMATION FOR SEQ ID NO: 2 09: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 
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AGATAAACCT 


CGATACGGCT 


TTCTGCTCGT 


TTGATGCGTT 


GCAAGAGTCC 


60 




ACGTCAACTG 


TATCCAAACC 


AAAGACCTTG 


TTTTATTTCA 


GCCAGTGGTG 


120 


A PT A A A C2 AGG 


GCATGGAAGT 


CTTCAATTTG 


GCTTCTAAGT 


TCTGGTAATT 


CTTGTCTGGC 


180 


P A TTTYZTTGT 


AAACTTTCTT 


TATCATCCGC 


TTGGTAGGCA 


TGACGAATGC 


GTCGTCCCAC 


240 




CTACTTAAAA 


TAGCATTCAA 


CTGGGCCTGA 


GTTTCAAAGA 


GATAAGCATA 


300 


P.TTTPCAGCT 


TTTTCTTTAA 

A A A A AAA * 


TGTCAGCAAT 


AGTTCCCGCC 


GCCTGAGCGA 


AGTGTGGCTT 


360 




out 1\? lUnivj x 


GTCGGTCAAG 


TATCGGACAG 


AGAACATCCT 


GATAAAAGAC 


420 


ATAGCGGTTG 


GGATTGATGC 


CACTGAGATT 


ACCTGGTAGG 


TCTGGTAAGA 


GGTTGGCAAG 


480 


ATCAATCTGC 


ATAAAATCCT 


CAACCGTTAG 


ACCAGTATTG 


GTCTTGAAAT 


GCGCAGACAA 


540 


ACTATCTAGG 


TCATTGCGAT 


AGATCAAGTT 


ATCGATACCG 


TCGACCTCGA 


GGGGGGCCGT 


600 


ACCCAGATTT 


GCAAGCTTGG 


TAGGATAAAG 


AACTGGGCAG 


TTTCACCACC 


ATTGTCTCCC 


660 


CAACCCGTTA 


CGATGACTTC 


TTTAATCTGA 


TTGGCACGGC 


AGGCTTTATT 


AGCCTCGATA 


720 


GCCACTAGAC 


GGCTAAAATG 


G 








741 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 



AAACTCGATT 


TGAGAGAAAG 


TCCAATAAGT 


CTTTCCATAA 


TAAAACGCAT 


AGTAGCAAGA 


60 


GTTTCTACAC 


CCACTATTAT 


GCGTTTTTAT 


CATTTTGAAA 


TCTACTGCTA 


ATCAATGAAG 


120 


AAGCAAAGAT 


CAAACTAGGC 


AGCTAGTTAC 


AAGTTACGTA 


AGCGCGTCAT 


AACAAGGTAT 


180 


CTATCCTAAT 


TCCCCAACTT 


CCAGTATACT 


AGATAAAAAA 


TAATTATCGG 


AGGAAAGGTA 


240 


TTGTACTATG 


ATAATTCAAC 


TAAGTGATTT 


AGGTCAAGTT 


CACCTTGTTT 


GTGGCAAGAC 


300 


AGATATGAGG 


CAGGGAATAG 


ACTCATTAGC 


CTATGTAGTT 


AAAACCCACT 


TTGAATTGGA 


360 


TCCTTTCTCC 


GGTCAAGCCT 


TTCTCTTTTG 


TGGTGGACGT 


AAAGACCGCT 


TTAAAGCCCT 


420 


TTACTGGGAT 


GGTCAAGGAT 


TTTGGCTACT 


ATATAAACGC 


TTTGAGAACG 


GAAAACTGAC 


480 


TTGACCAAGT 


ACAGAAAAGG 


ATGTCAAAGC 


TCTCACACCA 


GAACAAGTAG 


ACTGGCTTAT 


540 


GAAGGGCTTT 


TCTATCACTC 


CCCAAATATA 


GTGGATTTAA 


ACTAGAATAG 


TACACCCCTA 


600 


CTTCTAAAAC 


ATTGTTAGAA 


ATCGATTTGA 


CTGTCCTGAT 


CAATTTGTCC 


TGTTCTTATT 


660 


TCATTTTACT 


ATAGAATCCA 


TCTGAAAGCC 


GTTTATGATT 


TCTATTGAAA 


TGAAGACCGT 


720 


CCATTTTAGT 


AGACTAAAGG 


ATTACTCAAA 


ACTTCGAGAA 


GGAAGACAGA 


TGAACTTGTT 


780 


TCTTATAAAA 


ATGTTGTCTG 


AAACTATTCG 


CTTTTTTCCA 


GAAATTTTAG 


AGGAAGAAAA 


840 


CTTTATGAGG 


AAAAAGGAGT 


TACTCAAATA 


ACAATTCAGG 


ATTAAAAATA 


GACAGTTGAG 


900 


GAGCGGAAGG 


TATAAATTAA 


GTTTGCTACT 


GTATAATGGA 


TTTAAATCAC 


TCAGCAGACA 


960 


GAACGAATAC 


TCTTCGAAAA 


TCTCTTCAAA 


CCACGTCAGC 


TTCGCCTTGC 


CGTAGATCCA 


1020 
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TAGGGGACTG GACTTCATCA GTTCTATCAA 
CGATCAATTT TATCTGCACC TCAAAGCTGT 
ATTTGATTTT CATTGAATAT CAGAAACTCA 
TAATTTTTGG TCTGGCACGG TGTCAGAAAT 
TTTAAAAAAT CCAGTCTTGT AAAATTTAGA 
TTTGTCAATA ATATTCTTTT TTGAAATAG 

(2) INFORMATION FOR SEQ 



PCT/US97/05306 

CAACCTCAAA ACAGTGTTTC GAGCTGACTT 1080 
ACTTTGAGCA AGCCTGACGA CTAGCTTCCT 114 0 
TTCTCCATCA AATAATTCGA CTGCGTCTAA 1200 
AAAGGTTGTG TATTTGGAGA GGGGATTAAT 1260 
ACTATCAATC AGTAAGATGG TTTCATGGGC 1320 

1349 

ID NO:211: 



(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 1359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 

CTTTTTTATA AGCAATTCAA TTTTATACTC ATCTGCTTTC AAAAAGCATT CTAGTCCATC 

TCCGATTAAC GATGGACTTT ATCACCTCCT TCTCCAATCC TTGTATGACA TCTTGAAGTT 120 

GATTCATGAC ATCTTCCAAA ATTCGAAAGG CTTTATTCTT AAATCCACGT TTACGAATCT 180 

CTTTCCACAC TTGTTCAATG GGGTTCATCT CTGGTGTGTA TGGAGGAATA AATGCAAAAC 240 

CAATATTAGT CGGAATCTTT AAGACACTTG ATTTATGCCA TATAGCATTG TCCATAACGA 300 

GTAAAAAATA ATCATCTGGA TAAGCTTGTG AAAGCTCTTC TAAAAAGGCG TTCATCCACT 360 

CAGTATTACA TCTACCAGCT ATTAAGAAAA AATGATTCCG CCTGTTATGG GCATCAACAG 420 

CTCCATAACA ATAGTGAAAT TCTCGTATAT AGTGACTATG GACATGTGGA CCTACTCCTA 480 

TTGGAGACCA ACAAGATCCC AGTTTACTGA TTCTACCGAA ACCAGCCTCA TCTTGGTACA 540 

TCAAGCGAAC CTTATGAAAA CGTCTACTGG TTTAAATCGC TTTCTTGTCT TCTTGAATTG 600 

AGATTTTATT TTTAGACGCG ACAATGGTTT GAGCGTCTGC TTTCTTAGGA TGTTCTGGAC 660 

GTGGCATAAT ATTTCGCCAG CCATGGCGCT TCAACAGTTG ATAGAAGGCA TCACGTGTGT 720 

AGGAACGACC TAACTCCTTT TTATAAGCCT GAAATAAGGC ATCAATTGTA ACAAATTCTC 780 

CTGCCTCTGT AGCCTTCAAA TGGCGGGCAA GAAATACTTT CTCTTCCTCA ACTGTCATAT 840 

ATGCATGGTT ACACCACCAC TTTCCTGATA GAGTTGTTCA CATCTTATTT CAAACTACTA 900 

TAAAAGTTCT ATAATCTCTT TATAAGATTT GCCCATCAGA CGAAATATAA TAAATTGAAA 960 

CTAGAATAGT ACACCTCTAC TTCTAAAACA TTGTTAGAAA TCGATTTGAC TGTCCTGATC 1020 

GATTTGCCCT ATTCTTGTTT CGTTTTACTA TAGAACGATT TGAAGGCGTT TATGATATTG 1080 

AGCTGTACGA GAGTCTTTTA AAAGTGTTTT GATGGCTTGG ATTTCTTCTT TAGTTGATTT 1140 

CATATCACTA TTATATAATG CTTTTTGATC TTTAGTTTAG TATTATCGTA GAAAATGGAC 1200 
TAGGTGCTAT GGATACACCG GATGAGAATG GTTATGTAGC AGATGACTAT CGGATTACTT 
ACTTAGAGGC CCACATCAAG GCCATGCGAG ATGCCATTTA CCNAGACGGG GTTGACTTGC 
TTGGTTATAC GACTTGGGGC TGTATCGATC CAGTTTCAG 



1260 
1320 
1359 



(2) INFORMATION FOR SEQ ID NO: 212: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



CTAGTATTTC 


CGTTGTGNTT 


AGTTTGTCTG 


CCTATGGAGT 


TATTGCCTAT 


GTGCAAGGTC 


60 


AGNTGGATAT 


TCTTCTAGTG 


ATTCTGGCCA 


TGATTGGTGG 


TTTGCTCAGT 


TTCTTCATCT 


120 


TTAACCATAA 


GCCTGCTAAG 


ATCTTTATGG 


GTGATGTGGG 


AAGTTTGGCT 


TTAGGTGGAA 


180 


TGCTGGCAGC 


TATCTCTATG 


GCTCTCCACC 


AAGAATGGAC 


TCTCTTGATT 


ATCGGAATTG 


240 


TGTATGTTTT 


TGAAACAACT 


TCTGTTATGA 


TGCAAGTCAG 


TTATTTCAAA 


CTGACAGGTG 


300 


GTAAACGTAT 


TTTCCGTATG 


ACGCCTGTAC 


ATCACCATTT 


TGAGCTTGGG 


GGATTGTCTG 


360 


GTAAAGGAAA 


TCCTTGGAGC 


GAGTGGAAGG 


TTGACTTCTT 


CTTTTGGGGA 


GTTGGGCTTC 


420 


TAGCAAGTCT 


CCTGACCCTA 


GCAATTTTAT 


ATTTGATGTA 


AGAATGGCAC 


CCTGATGTTT 


480 


TAGGGTGTTT 


TTGTGTTTAA 


ATACACAATG 


AAAATCAAAG 


AACAAACTAG 


AAAGCTAACT 


54 0 


TTAGGCTGCT 


CAAAACATAG 


TATATTGAAA 


CTAAAATAGT 


ACACCTCTAC 


TTCTAAAACA 


600 


TTGTTAGAAA 


TCGATTTGAC 


TGTCCTGATT 


ACGATTTATC 


CTGTTCTTAT 


TTCATTTTAC 


660 


TATACCATCA 


ATAAGTGTTG 


AAAAGATCGT 


TGACAGAACT 


GACCAAGCCA 


GATCATCTTT 


720 


GTAACCACCT 


TGAGCTTGTA 


GCATATTTGT 


TAATTGAGCA 


AAGGCGCTAG 


AAGAGTTTGC 


780 


TGGGTCAGCA 


GTTGCGATTT 


TTCCTTTTAG 


TTCAGGTTTG 


AGAAGGTCGC 


TATATCCTTC 


84 0 


GATGTTCATG 


CCTTTAGTTA 


AATCAGGGTG 


GACGATTAAA 


ACACTACCAT 


CTAGTGTATA 


900 


AGGAGTAGAG 


AAGCCAGTTG 


TGTTTTGATA 


TTCTTTGATA 


ACATTATCAT 


TTTCTTTTGA 


960 


AATATAGTTT 


TCAAAGAGTT 


CTCCGTGGGT 


AGCATATTGT 


GTTATAAGAA 


CCACCAAAGA 


1020 


TAACATCAGC 


TACAGGAACT 


TCTTTTTCTG 


ACCTAGTTTT 


TTGAAAAGTT 


CTCCAGTACC 


1080 


AG 












1082 



(2) INFORMATION FOR SEQ ID NO:213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
CTGGACTTGA TAGGCATCTT TGTAATCCTC TAAAGCCTCT TTCATCAAGG CACTACCAAT 60 
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TCCTTGACGC 


TGATAGATTG 


GTAAAACGAT 


TAAATCCTGA 


ACCAATACTG 


ATGAGAATCC 


120 


ATCTCCAACC 


AAACGAATCA 


AGCCCACCAC 


AGCATCACCA 


TCAAGTGCCA 


CATAAATTAC 


180 


TAATGAATGA 


GATAAGGCCT 


GCTCCAGCAT 


CTCTGGTTGA 


TGGGTATAAT 


TAGTCCAACC 


240 


GACAGCCTGA 


TAGACATGTA 


AAACATCCTC 


TAGCTTGACA 


ATTTCTTGCT 


TTTTAATAGT 


300 


AATCATCTCA 


ACACCTCTTA 


AAGTTCTCTC 


AAGCTCTTGT 


ACTGCTGTCC 


ATTTTTATCA 


j \j \j 


AAATTTTCAG 


GACGCAACCA 


TGTTTCCAAA 


CGATCTTTGA 


CTTTGGGCCA 


GTCCTTATCA 




ATCATAGACA 


ACCAATCCAT 


ATCTCTCGTA 


CGCCCCTTAT 


AAACCACTGC 


CTGACGGAAG 


** O \J 


GTTCCTTCAT 


AAATAAAGCC 


CAAACGCTCC 


GCAGCACGTC 


TGGATGGCAG 


GTTAAGAGCA 


540 


TCGCATTTCC 


ACTCATAACG 


ACGATAGTTA 


AGCTCTTCAA 


AGACATAGCG 


AGCCAGTAGA 


DUU 


TACTGGGCTT 


CTGTCCCTAT 


CCGTGTCCCC 


CTGAGTTTTG 


GAGAAAAAGT 


GACAGCTCCC 




ACTTCTATTA 


CTCCGTTACT 


GGTCAATACG 


CATGAGAGAA 


AAATTCCCAA 


AGCCTTACCA 




GTTGCCTTGT CTATGTAGTA AAAACGGTCC 


TTACGAGCCA 


ACATCTGACT 


AC T AT AGT Ad 


/ ou 


ATTGAAACTA 


GAATAATATA 


CCTCTACTTC 


TAAAATATTG 


TTAGAAATCG 


ATTTGACTTP 




CCTGATCGAT 


TCGTCCTATT 


CTTATTTCAT 


TTTAATATAA 


TTGATAGTGG 


TCGCCCCAGC 


ODD 


CAGATACCTT 


ATCTGCTATC 


CATTTAGGAA 


CCCCTAACTT 


AAGCAATCCC 


CATAATCGTC 




TCGATTTCTT 


CTTCCATTGC 


TTCCAGATAA 


TCACTCGTAG 


GCGAGTACGC 


AAGCGCTCAT 


1020 


CTATGTTAGT 


GACTATACTT 


TTCATATTTA 


TAATTCATTC 


CTTTCGTTTC 


ACTCAAGGCA 


1080 


CAACACAGAA 


TGAAAAAGTG 


TTGTGATCTT 


TATTTTGTTT 


TATAATAATA 


GTGAGAAAGC 


1140 


CTATCACTAC 


TACAAATCAC 


GGGGAGGTGA 


ATAAGTGAGT 


GGTACAGCCA 


CTACCTCGCA 


1200 


TATTTTGTCA 


CATCATTTAA 


CGGTACATAA 


TAAGTTGTAC 


CATCTGAATA 


AGTTGCTACA 


1260 


ATATCATTTG 


CATGCTCTCC 


TTCACCTTTA 


GCAAAGGTTG 


GAG 




1303 



(2) INFORMATION FOR SEQ ID NO:214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

CTCTTCAATC AACTCACGCT TTAGGGCTTC CTGATGATTT TCACCTGCTT CAATTTCTCC 
ACATGGTAGG AACCAAGCAC CATTTGGTTC TTGAACAAGA ACAATTTGTT TTTGTTCAGG 
ATTAGGGATA ACTGCATATA CGCCATAGCG AGCAATATAG TCTGTATTCG CTTTTTTCTC 180 
CGAAAGTTGG GTTTGCCATT GCATTTTCCT CATTATCTAG TATCGTTATT ATTATAGTGA 240 
AATGAACCAA AAATAGTACA CAATGTGGTA TAATCTTTTT ATGGCATATT CAATAGATTT 
TCGTAAAAAA GTTCTCTCTT ATTGTGAGCG AACAGGTAGT ATAACAGAAG CATCACACGT 
TTTCCAAATC TCACGTAATA CCATTTATGG CTGGTTAAAG CTAAAAGAGA AAACAGGAGA 
GCTAAACCAC CAAGTAAAAG GAACAAAACC AAGAAAAGTT GATAGAGATA GACTTAAAAA 
CTATCTTACT GACAATCCAG ACGCTTATTT GACTGAAATA GCTTCTGAAT TTGGCTGTCA 540 
TCCAACTACC ATCCACTATG CACTCAAAGC TATGGGCTAC ACTCGAAAAA AAGAACCACA 600 
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PPT A PT A Tf2 A 


ACAAGACCPA 


GAAAAAGTAG 


CCTTATTTCT 

± x- x x x ^ x 


TAAGAATTTT 


ANTAGTTTAA 


660 




ACAGGTCTAA 


CAAATGGTGA 


ATTAATCGCT 


CCAATGACTT 


ACGAAGAGAT 


720 


GATGACGAGC 


GACTTTTTTG 


AAGTATGGTT 


TCAGAAGTTT 


CTCTTACCAA 


CATTAACCAC 


780 


APCATCGGTT 


ATTATAGTGA 


AATGAAATAA 


GAACAGGACN 


AATCGATCAG 


GACAGTAAAA 


840 


TPGAATTCTA 


ACAATGTTTT 


AGAAGTAGAG 


GTGTACTATT 


CTAGTTTCAA 


TCTACTATAT 


900 


TAATGGACAA 


TGCAAGATTC 


CATAGAATGG 


GTAAGCTAGA 


GTTCTTATGT 


GAAGAGTTTG 


960 




TTTACCTTTT 


CCTCCCTACT 


CATCTTAGTA 


TAGAAAAGTG 


AATCTAAAAT 


1020 


a^tapataap 


TGC TTCTAAA 


ACATTCTTAT 


AAATTGATTT 


AAATTCTCAA 


ATCATATTAT 


1080 




TTTP A TTTTfl 

1 X AV-rtX X X Xw 


CTPTACAATC 


CTGTTGAGAA 


GACACGTGTT 


CATATCAAAA 


1140 


aggtattggp 


AAGTTGCAAT 


ACCTTTTTAC 


AAGGTTCTTT 


TGTCTTATTT 


TTGTTTCAAC 


1200 


TCZ A PT AT ATT* 


TC CTATGGTT 


CTAGTTCAGA 


AGGCTAGGCT 


ATAATTATGA 


TTGATAAGAA 


1260 


GTATCAT i 




A PTf A ATY2TT 


TPA AAATPAT 


OVJU 1 1 1L 1 x\ X 


A A TP. P. TT A f*2P. 




CTGGCATTTG 


CTAGACCGCC 


ATCTTTACGA 


AGAAGTGGTT 


CTTTATAGCC 


TAGGAGAGTA 


1380 


CGAAGACTGG 


CAGTAAGATT 


GGCGCCGTGT 


CCGACAATTA 


GAATACGCTC 


AGCTGGACTA 


1440 


TCTTTTAATG 


ATTTGATAAA 


TTGGATGGTC 


CGCTGAGTTG 


TACTATAGAG 


GGATTCGGCT 


1500 


CCGAACATTC 


GAGTGTCAAA 


TTGAGCAAGA 


TTTGAACGAA 


AAGCCTGGAT 


TTGTTGCGGG 


1560 


TAAATAG 












1567 



(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 



CTGTTTCCAT 


AGCATGACTT 


CTGTACTAGA 


CTTTCTTTTC 


CGAATAAATA 


GATAGAACCA 


60 


CAGAATCTAG 


TAAACCTAGA 


ATTAAAATTA 


TGGTATAATA 


TTAGCAATAA 


AAGAAATCTG 


120 


GAGGATTAGA 


ATCATGGTAT 


CAACGAAAAC 


ACAAATTGCT 


GGTTTTGAGT 


TTGACAATTG 


180 


CTTGATGAAT 


GCAGCAGGTG 


TGGCTTGTAT 


GACGATAGAG 


GAGTTAGAAG 


AGGTCAAAAA 


240 


CTCAGCGGCA 


GGAACCTTTG 


TTACTAAGAC 


AGCGACCTTG 


GACTTCCGTC 


AGGGGAATCC 


300 


TGAGCCACGC 


TACCAAGATG 


TTCCACTTGG 


TTCCATCAAC 


TCTATGGGCT 


TGCCAAATAA 


360 


TGGCTTAGAC 


TATTATTTGG 


ATTATCTTTT 


AGATTTGCAG 


GAAAAAGAGT 


CGAACCGAAC 


420 


TTTCTTCTTA 


TCTCTGGTCG 


GCATGTCTCC 


AGAGAAACCC 


ATACTATTTT 


GAAAAAAGTC 


480 


CAGAGA 












486 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 814 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 





AAACJTTAAAT 


1 nn 1 1 inl Av7 


nnnlnl I 1 i A 


(jIAI PTAAGC 


CCTACTGTTA 


60 


TAGATTCAAT 


ACACTATATA 


TGTGTTTGTC 


TGATAAAAAT 


TTCTACTCTT 


TTTGATTTTA 


120 


AATAAGTATT 


AGTTTACATT 


ATGGTGTGAA 


TTGGGTTTGA 


TATCTCTTTT 


GAGGAAGTTG 


180 


CCTTAGATTT 


TTCTGATTGT 


GTTTTATTGT 


ACAGTGTATC 


TTGCTTGTTT 


TGAACAGAAT 


. 240 


TTTTTATGAC 


ATTTGTCATA 


TTTTCTAGTG 


ACAGAAGCTT 


CTGCCTCCTC 


TGATTTTAAA 


300 


AGACTATAAT 


TGTAGTATGA 


AATGGGGGAA 


GAAGAGATGA 


GAAATAAAAT 


GATTATCGCA 


360 


GTAAGTTTAG 


TAGTAGCAGG 


AGTTATGACC 


TATCTCATGT 


TTTCGGGATT 


GGATGAGAAT 


420 


TTCTACCATT 


TTCCTTGGGA 


GGTCTTTGCT 


GGCTTTGGAA 


TCATCTTCTT 


GGCTTGTCCA 


480 


GAGAAGGTTT 


GAAATTAGTA 


AGAGATGTGA 


AAAAGGAGTT 


TGAAAAATGA 


AAAAAGCAAC 


540 


TATCTATTTC 


TTTATCGGCC 


TGTCACTCTT 


GGTATGGTTG 


GTAGAAATGT 


TTACAGGTTG 


600 


GTTTGCTCAA 


ACCTTCCTTC 


ACCAGTTCAT 


CCGTGGTGCA 


TGGGGATTAG 


GATTTATGAT 


660 


TTTTATCGCC 


TTTCCGATGG 


GAAAGGAGTT 


GCTGGAAGGA 


GAATATCATG 


AACATGATTA 


720 


AGGTTCAAGG 


ACTACATAAA 


AATATTAAGG 


GCAAGGCTAT 


TTCAAAGGAT 


ATTTCCTTTG 


780 


AAGTAGCAGA 


AGGTGAATGC 


GTTGCCTTGA 


TTGG 
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(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1319 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



CTTGGTACGA 


CCGTATGGGT 


GGTCACTGAA 


AGTGGGAAAC 


TTCCAAGATG 


GGCACTGTGT 


60 


ACGGATCCCC 


GTAAACTGTC 


GCANAAGAAC 


TGAAGATGAT 


GTTTTTACAG 


TTGTTTTCTT 


120 


CCATGGCTTT 


CAAAAGGCTG 


ACAGTTCCAG 


CGATATTGTT 


GTCATAGTAG 


GCAAGAGGGA 


180 


TACGTGTTGA 


TTCGCCAACA 


GCCTTCAAAC 


CAGCAAAGTG 


AATGACACCA 


GTCAGTTCTT 


240 


CCTGCTTGAA 


AATATCTCTG 


AGGGTATCTG 


TGTCACNAAT 


ATCTGCCTCA 


TAGAAAGGAA 


300 


TCTCAACTCC 


TGTGATTCTT 


TCAACAACTT 


CTAAACTCTT 


ACNATTGCTA 


TTGACAAGAT 


360 


TATCCACCAC 


AACAACTTGA 


TGGCCTGCTT 


GGATCAATTC 


AATAACAGTG 


TGGGTTCCAA 


420 


TAAAACCGGC 


ACCACCAGTT 


ACCAAAATCT 


TTTCTTGCAT 


CTTTTTTCCT 


CGATTCTCAG 


480 


ATTATTTTTT 


CTTATTTTAA 


CCATTTTTGA 


CAGGGAATGT 


CATTTGCCAC 


CTTAAACTAC 


540 
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CTGATAAAAT 


TTCAGTAAAA 


TGCTTATACT 


CTTGGAAAAT 


CCAATTCAAA 


CCACGTCAAC 


600 


GTCGCCTTGC 


CATGGGTATG 


GTTACTGACT 


TCGTCAGTTC 


TATCCACAAC 


CTCAAAGCAG 


660 


CGCTTTGAGT 


AACCCGCGGC 


TAGTTTCNTA 


GTTTGTTCTT 


TGATTTTCAT 


TGAGTATTAT 


720 


TCGCTTTTTA 


CTCGTTTGAC 


ATAGTTTTCA 


ATTGGGTAGT 


TTTCAGGGTC 


CAAGGTCAAC 


780 


TCCTTGTCTT 


GGATCAGTTG 


AGCTAAATGG 


TAACCAATAA 


TAGGACCAGT 


TGTGAGGCCT 


840 


GATGAACCTA 


GTCCACTGGC 


TGCATAGACA 


CCAGTTAAGT 


CAGGCACCTG 


CCCAAAGAAA 


900 


GGAGAGAAAT 


CACTGGTGTA 


GGCACGGATT 


CCAACACGCT 


CAGATTTTGA 


AGTAGCTTCA 


960 


GCCAAAATCA 


GATAGTGAGT 


CAAGGTGGCC 


TCCTCCATTT 


GTTGGAGCAA 


GGTTTCATCT 


1020 


ACCGTCAAAT 


CAAATCCCAT 


GTCATTTTCG 


TGGGTAGCGC 


CTAAGGACAA 


TTTCCCACCT 


1080 


GCAAAAGGAA 


TCAAATCCCA 


CTCCCCTTCT 


GGCATGACAA 


CAGGGTAAGC 


TTCCATGTCT 


1140 


TGGACCAGCT 


GATAATCTCG 


TAGTTGTCCT 


TTTTGAGGAC 


GAACATCTAC 


TTCATAACCC 


1200 


AAGGGTTCTA 


ACATGTCCCC 


CAACCAAGCT 


CCNGTCGCCA 


AAATAACCTG 


CTCAAACTCC 


1260 


TCTTCACCAA 


TCTGGTAGCC 


TGATGCTAAC 


GGTGTCAGAG 


TCACTTTTTC 


TTTGACCAG 


1319 



(2) INFORMATION FOR SEQ ID NO:218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 



CTCAAAATAC 


TGTTTTGAGG 


TTGCAGATGG 


AAGCTGACGC 


GGTTTAAAGA 


GATTTTCGAA 


60 


GAGTATAAAC 


TGCTTATAAA 


ATAAAAAGGA 


GCCCTGATGG 


AACACATTAT 


TTATCAGTTT 


120 


GAAGAGGACT 


TGGCAATCCT 


TACCTTGAAC 


CGTCCTGAGG 


TCGCAAATGG 


TTTTCATATT 


180 


CCCATGTGTG 


AGGAGATTTT 


AGAAGCTCTG 


ACTTTGGCAG 


AAGAAAATCC 


AGCTGTGCAT 


240 


TTTATCTTAA 


TCAATGCCAA 


TGGAAAGGTC 


TTCTCAGTTG 


GGGGAGATTT 


GGTAGAGATG 


300 


AAGCGGGCAG 


TGGATGAGGA 


TGATATTCCA 


TCATTGACAA 


AAATCGCAGA 


GTTGGTCAAT 


360 


ACGATTTCTT 


ATAAAATCAA 


GCAAATAGCC 


AAACCTGTTT 


TTAATGGAAG 


TTGACGGTGC 


420 


TGTTGCAGGT 


GCCGCAGCGA 


ATATGGCTGT 


TGCGGCAGAT 


TTCTGTTTGG 


CGACGGATAA 


480 


GGCTAAGTTT 


ATCCCAAGCC 


CTTTGTTGGT 


GTGGTTGGCT 


CCAGATTCCA 


GGGG 


534 



(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) 



MOLECULE TYPE: 



Genomic DNA 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 219: 



60 



CTGCAGCCGG AAGTTATCTG GTTACAAAAA ATCGCCAAAC CATCACAGAT GAAAGTCTTG 

AATACCACTG ACCGCGTTCA AGCTATCAAG GACGATGTGG ATATTATCCA AAACAGCCTG 120 

CAAATCATTA ACCAGCAAAA AGAACTTATC AAGGAATACC AAGAAGACTT GACTTACAAG 180 

TTTAAGGTCT TGGAAAAGGA TATCCAAACT AGGACTAGCT GTGATAAAAG AAATGCAGGG 240 

AACTGAAGAT AAGTAAAAAG AGCCCGACGG CTCTTTTTTA AAACGTAAAC AAGTTATAGT 300 

GAATTGAATT TAGAATAGCA CATCATAGTT TCTAAAGCAT TGTTAGAGAT TACTTTAAAA 360 

TCCTCTTATC AATTTGTTCA TATTCTATTT CAATCTACTA TATATATCCC ATCAACTATG 420 

CATCATAATT TAGGTAACTC ATACTCAATA AAAATCAAAA AGCAAACTAG AAAGCTAGGC 480 

ACAGACTGCT CAAAACACCG TTTTAAGGTT GTGGATAGAA CTGACGAAGT CAGTAACCAT 540 

ACCCATGACA AGCCGACGCT GACGTGGTTT GAAGAAATTT TTGAAGAGTA TCATTCACCA 600 

TTTCACAACT AAATCGAAGA CTTTCTGCCG TAATGAAAAC ACCTGAAACA GCTTGGTTTC 660 

AGCTGTCCGG AAACTTTAAG ACTTAGGTTC AAAGTTTAGG TATGGAACTT CGAAGGAGGT 720 

CGCTACCGTC CGTCATTACT TAGNGAAGNC TTAAAAAATC TATAAAGTAA AAAGAGCCCG 780 
ACAG 



784 



(2) INFORMATION FOR SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



CTACTCGTTT TAGTATAGTA AACTGAATCT AGACTAGCAC AATGCNGCTT CTANAGCGTT 60 

TCGAAACTAG AATAGGACAA CATAACTGCT AAAAGATTTC TATAAATTCA TTTGATTTTC 120 

CTAATCAATT TGTTCGTATC CTANTTCACT CCACTATAAA ATATTCTTAT CAATTGATTT 180 

GNATGCCAAA ATTCCATCGT TCAGGTCTTA TTTCATTNGG CTNTAAAAAA CTCTCCTGGG 240 

NAATCCCAGG AGAGNATGAT TACNTATTTG AACTTGAATC CTTCGTANAT AAGCTCTGTT 300 

TTTGGATTTT GTTTCTTAAT CTGTTTGGCA AGTGCCTTCA TCATAGAAAT AGGACCACAC 360 

ATATAGACGG TTGCATGTTC GGGCACTTTT TTTGTTCAAA ATTAAGATAG CCGTCTTTCG 420 

TACTGTCGAT TAGATGGAGT TCAAAATTAG GATTTTTCTG AGCATAGTTA CGGAGTAAAT 480 

CTAGGTAGAC TGCATTTTCA TCTCCACGGA AGCTATAGTA GAAGTGAACC TGTTTATCTA 540 

AAATAGGATG TTCACGGATG TAAGAGATGA AGGGGGTGAT CCCAATACCT CCAGCAATCC 600 

AAACCTGATT TTCTCGTCCT TCTTCTATGA TCATGTGTCC GTAAGCTCTG TCTAGGGTTA 660 

CTTTGCTGCC GGCTTGAAGA TTATCATAGA TATTTCTTGG TATGGTCGCC TGAATTTTTA 720 

ACAGTAAAGT TAAAGAGTTT GAACCCATGA ACTCCCTGAG AATAGAAAAG GGATGCCGGA 780 
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ACCACTTTCA AAGCCTTCTT GGAAAATCTT 
GAAAGGTCTG CTAAGATGGA TTTGAATTTC 
GGTAATTTTC CCTAGATAGG GGAAGGAAAT 
AG 



PCTYUS97/05306 

TAGAAAGGCA AATTGTCCTG ATTGATAGTT 840 
TCTAGTATCG TGATTTAAGC GTTTGAGATG 900 
CTTTTGATAT AGAAAAATGA TATAAAAACC 960 

962 



(2) INFORMATION FOR SEQ ID NO:221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 



CTCTTTCCAC 


TATTATAACA 


CGAAAGAAAG 


AATTGTCAGG 


GAACTGTACA 


GATTTCTTTT 


60 


CTATCTATTT 


ATAACTCAAT 


GAAAATCAAA 


GAGCAAACTA 


GGAAACTAGC 


CGCAGGCTGT 


120 


AATTGAGTAC 


GGCAAGGCGA 


CGTTGACGCG 


ATTTGAATTT 


GATTTTCGCA 


GAGTATTATT 


180 


CGTAAAAAAT 


CTCAAAAAGC 


CTACCTTTCG 


GTAGACTTAG 


TTTGTTTCTA 


TTCTAATCGG 


240 


CACTCTTCCA 


AAATTTTGCT 


CTGC TATACT 


TGGCTTTCCT 


AGTTGGTAAA 


TCTGGTCAAC 


300 


CTTTTGAGTC 


ATAGCATCCC 


AAGGTTCTTT 


GCCAATTCGG 


CTGACTAGAT 


TGACCTGTCC 


360 


TTTCAGAGAC 


TTGAGATGTT 


GCCTGCCTTT 


TTCAGTAAAT 


CCAAGGACAT 


GAATGGCTTC 


420 


TGGCAAGTCA 


CTTTCTCTAG 


CCTGCATCAA 


AATATAGGTC 


AATAGGCGTC 


TGACACGCGC 


480 


CTTGGTGTAA 


CGTTTGGTAG 


TCACTAACTC 


GACCAATTCT 


TCCACAGACT 


GGGCTGTTTT 


540 


AATAGCTTCT 


TTAATGCGCA 


CAGCCATTTC 


TTGATTGACC 


TGATAGATGG 


TGGTTAGGTC 


600 


GGGATTTGAC 


AAGATTTGAT 


AGCGGAGCAA 


GGGAAAATAG 


TCTTCCCAGA 


TCACCTTACT 


660 


GGCTTGCTCA 


AATAGGGCAA 


CAGAAGGCAT 


AAAACGTTCT 


AAGAAATCTT 


GGTCCTTCTG 


720 


ATGTTGACGG 


AGGGCTGTCG 


CCGAGGCAAA 


GTCCACATCT 


TTATTCACAG 


AATGGTAACC 


780 


TGCCCCCTGA 


CGCTGAATCG 


GATGCAGCTT 


GATATTCCGT 


CCTGCAACCG 


CCTTGGCATA 


840 


GGCCAAAACA 


AAGAACATGA 


TTGGGGTGTA 


TTACCTGAAA 


AATCAAGACC 


AGCAAATTCC 


900 


TTCCACATAG 












910 



(2) INFORMATION FOR SEQ ID NO: 222: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 866 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



PCIYUS97/05306 



CGCCAAGTCA TCCAACAAGT CCAGAACGGT TCTGGGGCTT GGGATCCTCA GTCGCTGGAG 60 

ATAACTCCTT TGGGCTTGTT CATCATGTAG TAGACAAACT CTTCATACTC CAACACTTGC 120 

CCATCAAAAG CGAATCTCAT CTATTTTTTC ATCAATCTGC AATTTAGCTG ATTTTTCTTT 180 

TTTACCATTT ACAGTCACGC GCCCAGCCTT GAGCAAGTTT TTGACCTCAG TCCGACTTCC 240 

CACCGCACAG GCAACTAAAA ATTTATCTAA TCTCATAGAA CTATTATATC ATATCAAAAG 300 

GAGGCTGGTA CAATGACCAA CCTCCTTTTC GTTTCATACT CTTCAAAAAT CTCTTCAAAC 360 

CGCGTCAACG TCGCCTTGCC GTATATATGT TACTGACTTC GTCAGTTCTA TCTGCAACCT 420 

CAAAGCAGTG CTTTGAGCAA CCTGCGGCTA GTTTCCTAGT TTGCTCTTTG ATTTTCATTG 480 

AGTATCAGAT TTAGGAAATT AACTTCCTCG TCTCCAAAAA ATAGCTAAGA CAATCATGGC 540 

ACCTAAAACA GCTGGGATAA TAGCTGTTCC TGATAAAACT GGCCCCCAAG TTCCAAAGAG 600 

CAAGTGACCT AGAAAGGCTC CGATCCAACC GAGAAACATT TTTCCAAAAC ATCCCATTCG 660 

CTCTCCACGA TTGGTCATAG CACCTGCTAA AAATCCCACT AGGAGACCAA CGAACATACT 720 

TCCTAACATA TTATCTCCTT AATTTGCCCA ATTCCCATTA CGGAAAAGAA GTACTCGCGT 780 

TCCATCCTCA CGAATACCAT CGATATCCAT TTGGTTAGAA CCAATCATAA AGTATACGTG 840 
AACATCTGAA CGGTTAAGCC CTGCAG 



866 



(2) INFORMATION FOR SEQ ID NO: 223: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 



CTTCAGGATG TGGGAAAACG ACCCTTCTTC GTATGATTGC AGGTTTCAAC AGTATCAAAG 60 

ATGGAGAATT TTACTTCGAT GATACAAAAA TCAATAATAT GGAACCCAGC AAACGCAATA 120 

TCGGGCTGGT TTTCCAAAAC TACGCTATTT TCCCACATTT GACTGTCCGA GACAACGTTG 180 

CTTTTGGTCT TATGCAAAAG AAGGTTCCAA AAGAAGAATT GATTCAACAG ACCAACAAGT 240 

ATCTTGGACT CATGCAAATT GCTCAATATG CGGATCGAAA GCCCGATAAA CTCAGTGGTG 300 

GACAACAACA ACGTGTCACC TTGGCATGCG TCTTAGCGGT TAATCCAAGT GTTCTCCTCC 360 

ATGGACGAAG CCACTTAGTA ATCTGGAGGC CAAACTTCGC TTGGATATGC GTTCAAGCCC 420 

ATCCCGAGAA ATCCAACCAC GAAGTTGGGG AATTACAACT GTTTATGTAA CCCACGACCA 480 

AGAAGGAGCC ATGGCTATTT CAGACCAAAT TGCCTGTTAT GAAAGATGGG GTGATCCAAC 540 

AAATCGGCCG ACCAAAAGAA CTGTATCATA AACCAGCTAA TGAGTTAGTG GCAACCTTTA 600 
TCGGACGCAC AAATATTATC CCTGCCAATC TTGAAAAACG GAGCGACGGC GNTTATATCG 
TCTNTTCAGA TGGANANGCC CTTCGAATGA TAG 



660 
693 



(2) INFORMATION FOR SEQ ID NO: 224: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 



Met Ser Met Phe Thr Met Val Lys Met Asn Pro Leu Arg Gly Leu lie 

15 10 15 

Cys Asn Leu Lys Gly Asn Lys lie Thr Ala Leu lie Gly Pro Ser Gly 

20 25 30 

Ser Gly Lys Ser Thr Tyr Leu Arg Ser Leu Asn Arg Met Asn Asp Thr 

35 40 45 

lie Asp He Ala Lys Val Thr Gly Gin He Leu Tyr Arg Gly He Asp 

50 55 60 

Val Asn Arg Pro Glu He Asn Val Tyr Glu Met Arg Lys His He Gly 
65 70 75 80 

Met Val Phe Gin Arg Pro Asn Pro Phe Ala Lys Ser He Tyr Arg Asn 

85 90 95 

He Thr Phe Ala His Glu Arg Ala Gly Val Lys Asp Lys Gin Val Leu 

100 105 110 

Asp Glu He Val Glu Thr Ser Leu Ser Gin Ala Ala Leu Trp Asp Gin 

115 120 125 

Val Lys Asp Asp Leu His Lys Ser Ala Leu Thr Leu Ser Gly Gly Gin 

130 135 140 

Gin Gin Arg Leu Cys He Ala Arg Ala He Ser Val Lys Pro Asp He 
145 150 155 160 

Leu Leu Met Asp Glu Pro Ser Leu Ser Leu Gly Ser Asp Cys Asp His 

165 170 175 

Ala Thr Arg Arg Asp His Val 
180 



(2) INFORMATION FOR SEQ ID NO: 225:. 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 247 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) 



MOLECULE TYPE: protein 



PCT/US97/05306 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

Met He Lys He Ser Asn Leu Ser Lys Ser Phe Ser Gly Gin Thr Val 

15 10 is 

Leu Asp His Leu Asn Leu Asp He Gin Lys Gly Glu Val Val Ala Leu 

20 25 30 

He Gly Ser Ser Gly Ala Gly Lys Ser Thr Phe Leu Arg Ser Leu Asn 

35 40 45 

Tyr Leu Glu Thr Pro Asp Ser Gly Ser He Gin He Asp Gly Phe Ser 

50 55 60 

Val Asp Phe Ser Lys He Thr Gin Glu Glu He Leu Ala Leu Arg Arg 
65 70 75 so 

Lys Leu Ser Met Val Phe Gin Gin Phe Asn Leu Phe Glu Arg Arg Thr 

85 90 95 

Ala Leu Asp Asn Val Lys Glu Gly Leu Val Val Val Lys Lys Leu Ser 

100 105 no 

Asp Gin Glu Ala Thr Lys He Ala * Lys Glu Glu Leu Ala Lys Val Gly 

H5 120 125 

Leu Ser Asp Arg Glu Asn His Tyr Pro Arg His Leu Ser Gly Gly Gin 

130 135 140 

Lys Gin Arg Val Ala Leu Ala Arg Ala Leu Ala Met Lys Pro Asp Val 
145 150 155 leo 

Leu Leu Leu Asp Glu Pro Thr Ser Ala Leu Asp Pro Glu Leu Val Gly 

165 170 175 

Glu Val Glu Lys Ser He Ala Asp Ala Ala Lys Ser Gly Gin Thr Met 

180 185 190 

He Leu Val Ser His Asp Met Pro Phe Val Ala Gin Val Ala Asp Lys 

195 200 205 

He Leu Phe Leu Asp Lys Gly Lys lie He Glu Ser Gly Thr Pro Asp 

210 215 220 

Glu He He His Thr Pro Lys Glu Glu Arg Thr Lys Glu Phe Phe Thr 
225 230 235 240 

Ser Tyr Lys Arg Thr Tyr He 
245 

(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 6: 

Met Phe Ser Leu Arg Ser Val Phe Asp Gly He Pro Arg He Val Gin 

15 10 15 

Gin Leu Pro Thr Thr He Met Leu Thr He Gly Gly Ala Leu Phe Gly 

20 25 30 

Leu Val Leu Ala Leu Leu Phe Ala He Val Lys He Asn Arg Val Lys 

35 40 45 

He Leu Tyr Pro Leu Gin Ala Phe Phe Val Ser Phe Leu Lys Gly Thr 

50 55 60 

Pro He Leu Val Gin Leu Met Leu Thr Tyr Tyr Gly He Pro Leu Ala 
65 70 75 80 

Leu Lys Ala Leu Asn Gin Gin Trp Gly Thr Gly Leu Asn He Asn Ala 

85 90 95 

He Pro Ala Ala Ala Phe Ala He Val Ala Phe Ala Phe Asn Glu Ala 

100 105 110 

Ala Tyr Ala Ser Glu Thr He Arg Ala Ala He Leu Ser Val Asn Pro 

115 120 125 

Gly Glu He Glu Ala Ala Arg Ser Leu Gly Met Thr Arg Ala Gin Val 

130 135 140 

Tyr Arg Arg Val He He Pro Asn Ala Ala Val Val Ala Thr Pro Thr 
145 150 155 160 

Leu He Asn Ser Leu He Gly Leu Thr Lys Gly Thr Ser Leu Ala Phe 

165 170 175 

Ser Ala Gly Val Val Glu Val Phe Ala Gin Ala Gin He Leu Gly Gly 

180 185 190 

Ala Asp Tyr Arg Tyr Phe Glu Arg Phe He Ser Val Ala Leu Val Tyr 

195 200 205 

Trp Val Val Asn lie Gly lie Glu Ser Leu Gly Arg Phe lie Glu Arg 

210 215 220 

Lys Met Ala He Ser Ala Pro Asp Thr Val Gin Thr Asp Val Lys Gly 
225 230 235 240 

Asp Leu Arg 



(2) INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

Met Tyr Arg lie Asp Asp Asp Glu Gin Phe Val Leu Asp Phe Leu Lys 

15 io 15 

Gin Glu Lys Val Leu Leu Val His Gly Arg Gly Phe Asn Trp Gin Glu 

20 25 30 

Pro Asp His Phe Arg He Val Tyr Leu Pro Arg Val Asp Glu Leu Ala 

35 40 45 

Gin He Gin Glu Lys Met Thr Arg Phe Leu Lys Gin Tyr Arg Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

Met Asn Asp Glu Ala Ser Lys Gin Leu Thr Asp Ala Arg Phe Lys Arg 

1 5 io 15 

Leu Val Gly Val Gin Arg Thr Thr Phe Glu Glu He Leu Ala Val Leu 

20 25 30 

Lys Thr Ala Tyr Gin Leu Lys His Ala Lys Gly Gly Arg Lys Pro Lys 

35 40 45 

Leu Ser Leu Glu Asp Leu Leu Met Ala Thr Leu Gin Tyr Val Arg Glu 

50 55 ' 60 

Tyr Arg Thr Tyr Glu Gin He Ala Ala Asp Phe Gly He His Glu Ser 
65 70 75 80 

Asn Leu He Arg Arg Ser Gin Trp Val Glu Val Thr Leu Val Gin Ser 

85 90 95 

Gly Val Thr He Ser Arg Thr Pro Leu Ser Ser Glu Asp Thr Val Met 

100 105 no 

He Asp Ala Thr Glu Val Gin He Asn Arg Pro Lys Lys Arg He Ser 
115 120 125 



189 



WO 97/37026 

Glu Ser Phe Trp 
130 
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(2) INFORMATION FOR SEQ ID NO: 229: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 



Met Arg Arg Lys Tyr Lys Ser lie Ala Leu Lys Lys Glu Leu Ala Asn 

15 10 15 

His Ser Gly Lys Lys Lys Phe His Ala Met Lys Ala Gin Ala He Val 

20 25 30 

Thr Ser Gin Gly Arg He Val Ser Leu Asp He Ala Val Asn Tyr Ser 

35 40 45 

His Asp Met Lys Leu Phe Lys Met Ser Cys Arg Asn He Gly Gin Ala 

50 55 60 

Gly Lys He Leu Ala Asp Ser Gly Tyr Gin Gly Pro Met Lys He Tyr 
65 70 75 80 

Pro Gin Ala Gin Thr Pro Arg Lys Ser Ser Lys Leu Lys Pro Leu He 

85 90 95 

Ala Glu Asp Lys Ala Tyr Asn His Ala Leu Ser Lys Glu Arg Ser Lys 

100 105 110 

Val Glu Asn He Phe Ala Lys Val Lys Thr Phe Lys Met Phe Ser Thr 

115 120 125 

Thr Tyr Arg Asn His Arg Lys Arg Phe Gly Leu Arg Met Asn Leu He 

130 135 140 

Ala Gly He He Asn Tyr Glu Leu Gly Phe 
145 150 

(2) INFORMATION FOR SEQ ID NO: 230: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
<B) TYPE: amino acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

Met Val Ser Ser Ser Gly Ser Glu Phe Gin Ser Gly Trp Gin Glu His 

15 10 15 

Gin Leu lie Ala Glu Lys Val Ser Lys Thr Leu Asp Lys Thr Phe Asp 

20 25 30 

Lys Asp Val Arg Lys He Pro Thr Ser Pro Val Leu Ser Lys He Cys 
35 40 45 

Arg 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 

Met He Lys He Leu Ala Ala Cys Gly Ala Gly Val Asn Ser Ser His 

15 10 15 

Gin He Lys Ser Ala Leu Glu Glu Glu Leu Ser Asn Arg Gly Tyr Asp 

20 25 30 

Val His Cys Asp Ala Val Met Val Lys Asp Val Asn Glu Asp Leu Met 

35 40 45 

Lys Gly Tyr Asp He Phe Thr Pro He Ala Ala Thr Asp Leu Gly Phe 

50 55 60 

Glu Pro Gly He Pro Val He Glu Ala Gly Pro He Leu Phe Arg lie 
65 70 75 80 

Pro Ala Met Ser Ala Pro Val Phe Asp Asn He Arg Leu Pro Ala Lys 
85 90 95 

Gin Asn Met Val 
100 

(2) INFORMATION FOR SEQ ID NO:232: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

Met Asn Thr Lys Met Met Ser Gin Phe Ser Val Met Asp Asn Glu Met 

15 10 15 

Leu Ala Cys Val Glu Gly Gly Asp lie Asp Trp Gly Arg Glu He Ser 

20 25 30 

Cys Ala Ala Gly Val Ala Tyr Gly Ala He Asp Gly Cys Ala Thr Thr 
35 40 45 

Val 



(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Met Arg Glu Lys Glu He Phe Asp Ser He Val Thr He He Gin Glu 

15 10 15 

Arg Gin Gly Glu Asp Phe Val Val Thr Glu Ser Leu Ser Leu Lys Asp 

20 25 30 

Asp Leu Asp Ala Asp Ser Val Asp Leu Met Glu Phe He Leu Thr Leu 

35 40 45 

Glu Asp Glu Phe Ser He Glu lie Ser Asp Glu Glu He Asp Gin Leu 

50 55 60 

Gin Ser Val Gly Asp Val Val Lys He He Gin Gly Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 0 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 4: 

Met Ala His Gly Asp Leu Leu Tyr His Asp Gly Leu Phe Phe Ser Ala 

15 10 15 

Lys Lys Glu Asp Gly Thr Tyr Asp Phe His Glu Asn Phe Glu Tyr Val 

20 25 30 

Thr Pro Trp Leu Lys Gin Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 

{ C ) STRANDEDNESS : s ingl e 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
Met Ser Leu lie Thr His Arg Arg Phe lie Ser Cys Asn Glu Asn He 

Lys His Tyr Lys Arg Leu He Asp Lys Ala Glu Lys Cys Val Asn Asp 

20 25 30 

Leu Met Ala Glu Leu Asn Ser Val He Thr Thr Val Thr Gly He Glu 

35 40 45 

Asn Arg Leu Gly Ala Val He Leu Ala Glu He Arg Asn lie His Ala 

50 55 60 

Phe Asp Asn Pro Ala Gin Leu Gin Ala Phe Ala Gly Leu Asp Ser Ser 
65 70 75 80 

He Tyr Gin Ser Gly Gin He Asp Leu Val Gly Arg Met Val Lys Arg 

85 90 95 

Gly Ser Leu His Leu Arg 

100 

(2) INFORMATION FOR SEQ ID NO: 236: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Met He Arg Ala Val Phe Phe Ser He Asp Asn Asp Met Glu Glu Ala 

15 10 15 

Ala Arg Ser Met Gly Ala Ser Ser Phe Tyr Thr Met Val Arg Val He 

20 25 30 

He Pro Tyr He Leu Pro Val Val Leu Ser Val Val Val Leu Asn Phe 

35 40 45 

Asn Ser Leu Leu Ser Asp Tyr Asp Leu Ser Val Phe Leu Tyr His Pro 

50 55 60 

Leu Phe Gin Pro Leu Gly He Val He Lys Gin Ser Thr Asp Glu Thr 
65 70 75 80 

Ala Thr Leu Asn Ala Gin Ala Met Met Phe Val Tyr Ser Val He Leu 

85 90 95 

Met He Met Ser Ser Ala Ala Leu Tyr Leu Ser Ser Leu Phe Gin Gly 

100 105 110 

Lys Arg Gly Lys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 237: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

Met Glu Lys Cys Tyr Thr Asp Val Thr Glu Phe Ala He Pro Ala Ser 

15 10 15 

Thr Gin Lys Leu Tyr Leu Ser Pro Val Leu Asp Gly Phe Asn Ser Glu 
20 25 30 
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He He Ala Tyr Asn Leu Ser Thr Ser Pro Asn Leu Glu Gin Val Gin 

35 40 45 

Thr Met Leu Glu Gin Ala Phe Thr Glu Lys His Tyr Glu Asn Thr He 

50 55 go 

Leu His Ser Asp Gin Gly Trp Gin Tyr Gin His Asp Ser Tyr His Arg 
65 70 75 go 

Phe Leu Arg Val Arg Glu Phe Lys His Leu Cys His Ala Arg Glu Thr 

85 90 95 

Ala Lys Thr Thr Val 
100 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

Met Glu lie Leu Thr Thr Gin Gin Val Arg Glu Gly Tyr Gin Ala lie 

Leu Ala Ser Pro Leu Gly Leu Gin Asp Ala Phe Glu Val Ala Gin Glu 

20 25 30 

Lys Ser Gly Ser Tyr Thr Val Pro Pro Glu lie Asp Gly Trp Lys Gly 

35 40 45 

Asn Thr Glu Pro Leu Arg He Asp Tyr Val Phe Thr Thr Lys Glu Leu 

50 55 60 

Ala Val Glu Asn Leu His Val Val Phe Asp Gly Asn Lys Ser Pro Gin 

" 70 7 * 80 

Val Ser Asp His Tyr Gly Leu Asn Ala Met Leu Asn Trp Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Met Leu Lys His Leu Asn Leu Lys Gly His Leu Leu Thr Ala lie Ser 

15 10 15 

Tyr Met lie Pro He Val Cys Gly Ala Gly Phe Leu Val Ala He Gly 

20 25 30 

Leu Ala Met Gly Gly Gly Val Pro Asp Ala Leu Val Ala Gly Lys Phe 

35 40 45 

Thr He Trp Asp Ala Leu Ala Thr Met Gly Gly Lys Ala Leu Gly Leu 

50 55 60 

Leu Pro Val Val He Ala Thr Gly Leu Ser Tyr Ser He Ala Gly Lys 
65 70 75 80 

Pro Gly He Ala Pro Gly Phe Val Val Gly Leu He Ala Asn Ser Val 

85 90 95 

Gly Ser Gly Phe He Gly Gly He Leu Gly Gly Tyr He Val Gly Phe 

100 105 110 

Leu Val Gin Ala He lie Lys Lys Val Lys Val Pro Asn Trp He Lys 

115 120 125 

Gly Leu Met Pro Thr Leu He He Pro Phe Val Pro Leu Trp 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

Met He Ala Leu Gly Trp Ser Asn He Gly Ala Ala He Ala Pro Asp 

15 10 15 

Ala Ala Leu Ala Ser Val Ala Ala Ala He He Met Val Leu Gly Gly 

20 25 30 

Asp Phe Thr Lys Thr Gly He Gly Val Ala Gin Ala Val Ala He Pro 

35 40 45 

Leu Ala Val Ala Gly Leu Phe Leu Thr Met lie Val Arg Thr lie Ser 
50 55 60 
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Val Gly Leu Val His Thr Ala Asp Ala Ala Ala Lys Lys Gly Asp Phe 
65 70 75 80 

Gly Ala Val Glu Arg Ala His Phe He Ala Leu Leu Phe Gin Gly Leu 

85 90 95 

Arg He Ala Leu Pro Ala Ala Leu Leu Leu Met Val Pro Thr Glu Thr 

100 105 no 

Val Gin Ser lie Leu Ser Ala Met Pro Asp Trp Leu Lys Asp Gly Met 

115. 120 125 

Ala He Gly Gly Gly Met Val Val Ala Val Gly Tyr Ala Met Val He 

130 135 140 

Asn Met Met Ala Thr Arg Glu Val Trp Pro Phe Phe Ala Leu Gly Phe 
145 150 155 160 

Val Leu Ala Ala Val Ser Asp He Thr Leu He Gly Phe Gly Ala He 

!65 170 i 75 

Gly Val Ala He Ala Leu He Tyr Leu His Leu Ser Lys Thr Gly Gly 

180 185 190 

Asn Gly Gly Gly Gly Ala Ala Thr Ser Asn Asp Pro He Gly Asp He 

195 200 205 

Leu Glu Asp Tyr 
210 

(2) INFORMATION FOR SEQ ID NO.-241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:241: 

Met Leu Asp Val Glu Ala He Arg Lys Asp Phe Pro He Leu Asp Gin 

1 5 10 15 

He Val Asn Asp Glu Pro Leu Val Tyr Leu Asp Asn Ala Ala Thr Thr 

20 25 30 

Gin Lys Pro Leu Val Val Leu Lys Ala He Asn Ser Tyr Tyr Glu Gin 

35 40 45 

Asp Asn Ala Asn Val His Arg Gly Val His Thr Leu Ala Glu Arg Ala 

50 55 so 

Thr Ala Ser Leu 
65 
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(2) INFORMATION FOR SEQ ID NO; 242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 



Met Thr Lys Leu Leu Ash Lys Lys Trp Lys Val Lys He Met Lys Gin 

15 10 15 

He Leu Leu Val Cys Asn Ala Gly Met Ser Thr Ser Met Leu Val Lys 

20 25 30 

Lys Met Gin Gin Ser Ala Thr Glu Arg Gly He Glu He Ser He Gin 

35 40 45 

Ala Lys Ser Met Thr Glu Ala Lys Lys Asn He His Glu Ala Asp Val 

50 55 60 

He Leu He Gly Pro Gin He Arg Tyr Glu Leu Leu Ala Val Lys Glu 
65 70 75 80 

He Ala Gly Asn He Pro Val Asp Thr He Asp Met Arg Asp Tyr Gly 

85 90 95 

Met Met Asn Gly Ala Lys Val Leu Glu Gin Ala Leu Glu Trp He Gly 
100 105 110 

Glu lie Arg 
115 



(2) INFORMATION FOR SEQ ID NO: 243: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

Met Val Leu Met Gin Lys Met His Gly Lys Glu Leu He Thr Leu Asn 
15 10 15 
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Gin Thr Val Lys Trp Tyr Lys Val Ser Gly Phe Met lie Leu Leu Thr 

20 25 30 

Lys Pro Trp Tyr Tyr Leu Lys Ser Asp Gly Ser Tyr Ala Arg Asn Ala 

35 40 45 

Trp Gin Gly Asn Tyr Tyr Leu Lys Ser Asp Gly Lys Met Ala val Asn 

50 55 eo 

Glu Trp Val Tyr Asp Ala Thr Tyr Gin Ala Trp Tyr Tyr Leu Thr Ser 
65 70 75 80 

Asp Gly Ser Tyr Ala Tyr Ser Thr Trp Gin Gly Asn Tyr Tyr Pro Lys 
85 90 95 

He Gly Trp 



(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: 

Met Leu Thr Val His Gly Lys Glu lie Thr lie Leu Lys Ser Asp Gly 

15 10 15 

Lys Met Ala Val Asn Glu Trp Val Asp Gly Gly Arg Tyr Tyr Val Gly 

20 2 5 30 

Ala Asp Gly Val Trp Lys Glu Gly Gin Ala Ser Thr Ala Ser Pro Ser 

35 40 45 

Asn Asp Ser Asn Ser Glu Tyr Ser Cys Cys Phe Arg Lys Gly Lys Lys 
50 55 60 

Leu 
65 



(2) INFORMATION FOR SEQ ID NO: 24 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



Met Asn Asn Asn Phe Asn Asn Phe Asn Asn Met Asp Asp Leu Phe Asn 

15 10 15 

Gin Leu Met Gly Gly Met Arg Gly Tyr Ser Ser Glu Asn Arg Arg Tyr 

20 25 30 

Leu He Asn Gly Arg Glu Val Thr Pro Glu Glu Phe Ala His Tyr Arg 

35 40 45 

Thr Thr Gly Gin Leu Pro Gly Asn Ala Glu Thr Asp Val Gin Met Pro 

50 55 60 

Gin Gin Ala Ser Gly Met Lys Gin Asp Gly Val Leu Ala Lys Leu Gly 
65 70 75 80 

Arg Asn Leu Thr Ala Glu Ala Arg Glu Gly Lys Leu Asp Pro Val He 

85 90 95 

Gly Arg Asn Lys Glu He Gin Glu Thr Ser Glu He Leu Ser Arg Arg 

100 105 HO 

Thr Lys Asn Asn Pro Val Leu Val Gly Asp Ala Gly Val Gly Lys Thr 

115 120 125 

Ala Val Val Glu Gly Leu Ala Gin Ala He Val Asn Gly Asp Val Pro 

130 135 140 

Ala Ala He Lys Asn Lys Glu He He Ser He Asp He Ser Gly Leu 
145 150 155 160 

Glu Ala Gly Thr Gin Tyr Arg Gly Ser Phe Glu Glu Asn Val Gin Asn 

165 170 175 

Leu Val Asn Glu Val Lys Glu Ala Gly Asn He He Leu Phe Phe Asp 

180 185 190 

Glu He His Gin He Leu Gly Ala Gly Ser Thr Cys Gly Asp Ser Gly 

195 200 205 

Ser Lys Gly Leu Ala Asp He Leu Ser Gin Ser He Ser Leu Val Glu 
210 215 220 

Asn 

225 



(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

200 



WO 97/37026 



PCT/US97/05306 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Met Thr He Phe Pro Asp Asp Phe Leu Trp Gly Gly Ala Val Ala Ala 

15 10 15 

Asn Gin Val Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Leu Ser Val 

20 25 30 

Gin Asp Val Leu Pro Lys Gly Gly Leu Gly Glu Ala Thr Glu Asn Pro 

35 40 45 

Thr Glu Asp Asn Leu Lys Leu He Gly He Asp Phe Tyr His Lys Tyr 

50 55 60 

Lys Glu Asp He Ser Leu Phe Ser Glu Met Gly Phe Asn Val Phe Arg 
65 70 75 eo 

Thr Ser lie Ala Trp Ser Arg He Phe Pro Lys Gly Asp Glu Glu Glu 

85 90 95 

Pro Asn Glu Ala Gly Leu Lys Tyr Tyr Asp Glu Leu Phe Asp Glu Leu 

100 105 110 . 

His Ala His Gly He Glu Pro Leu Val Thr Leu Ser His Tyr Glu Thr 

H5 120 125 

Pro Leu Tyr Leu Ala Arg Lys Tyr His Gly Trp He Asp Arg Lys Asn 

130 135 140 

Asp Ser Phe Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

Met He Glu Tyr Lys Asn Val Ala Leu Arg Tyr Thr Glu Lys Asp Val 

15 10 15 

Leu Arg Asp Val Asn Leu Gin He Glu Asp Gly Glu Phe Met Val Leu 

20 " 30 

Val Gly Pro Ser Gly Ser Gly Lys Thr Thr Met Leu Lys Met He Asn 

35 40 45 

Arg Leu Leu Glu Pro Thr Asp Gly Asn He Tyr Met Asp Gly Lys Ara 
50 55 60 
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He Lys Asp Tyr Asp Glu Arg Glu Leu Arg Leu Ser Thr Gly Tyr Val 
65 70 75 80 

Leu Gin Ala He Ala Leu Phe Pro Asn Leu Thr Val Ala Glu Asn He 

85 90 95 

Ala Leu He Pro Glu Met Lys Gly Trp Ser Lys Glu Glu He Thr Lys 

100 105 110 

Lys Thr Glu Glu Leu Leu Ala Lys Val Gly Leu Pro Val Ala Glu Tyr 

115 120 125 

Gly His Arg Leu Pro Ser Glu Leu Ser Gly Gly Glu Gin Gin Arg Val 

130 135 140 

Gly He Val Arg Ala Met lie Gly Gin Pro Lys lie Phe Leu Met Asp 
145 150 155 160 

Glu Pro Phe Ser Ala Leu Asp Ala lie Ser Arg Lys Gin Leu Gin Val 

165 170 175 

Leu Thr Lys Glu Leu His Lys Glu Phe Gly Met Thr Thr lie Phe Val 

180 185 190 

Thr His Asp Thr Asp Glu Ala Leu Lys Leu Ala Asp Arg lie Ala Val 

195 200 205 

Leu Gin Asp Gly Glu lie Arg Gin Val Ala Asn Pro Glu Thr lie Leu 

210 215 220 

Lys Val Pro Ala Thr Asp Phe Val Ala Asp Leu Phe Gly Gly Ser Val 
225 230 235 240 

His Asp 



(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

Met Pro Phe Val Pro Gly lie Ala Leu Thr Asn Ala Val Arg Asp lie 

15 10 15 

Met Thr Asn His lie Asn Ser Gly Met Ser Lys Met Phe Glu Ser Leu 

20 25 30 

Leu He Thr Leu Ala Leu Gly Ala Gly Thr Ser Val Ala Leu Val Leu 
35 40 45 



202 



WO 97/37026 

Met Asn 
50 
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(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 



Met Tyr Thr lie Arg Met Leu Lys 

1 5 
Ser Ala Gin Glu His Gly Leu Lys 
20 

Pro Gly Ser Gly Arg Glu Ser Ala 
35 40 

Ser 



Met Gly Ser Glu Ala Ala Ala Lys 

10 is 
Ser Val Glu Val Thr Val Lys Gly 
25 30 
lie Ser Cys Ala Cys Cys Arg Trp 
45 



(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Met lie Glu Phe Glu Lys Pro Asn He Thr Lys lie Asp Glu Asn Lys 

1 . 5 10 15 

Asp Tyr Gly Lys Leu Val He Glu Pro Leu Glu Arg Gly Tyr Gly Thr 

20 25 30 

Ala Leu Gly Asn Ser Leu Arg Arg Val Leu Leu Ala Ser Leu Pro Gly 

35 40 45 

Ala Ala Val Thr Ser He Asn lie Asp Gly Val Leu His Glu Phe Asp 
50 55 go 
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Thr Val Pro Gly Val Arg Glu Asp Val Met Gin lie He Leu Asn He 
65 70 75 80 

Lys Gly He Ala Val Lys Ser Tyr Val Glu Asp Glu Lys He He Glu 

85 90 95 

Leu Asp Val Glu Gly Pro Ala Glu Val Thr Ala Gly Asp He Leu Thr 

100 105 110 

Asp Ser Asp He Glu He Val Asn Pro Asp His Tyr Leu Phe Thr He 

115 120 125 

Gly Glu Gly Ser Ser Leu Lys Ala Thr Met Thr Val Asn Ser Gly Arg 

130 135 140 

Gly Tyr Val Pro Ala Asp Glu Asn Lys Lys Asp Asn Ala Pro Val Gly 
145 . 150 155 160 

Thr Leu Ala Val Asp Ser He Tyr Thr Pro Val Thr Lys Val Asn Tyr 

165 170 175 

Gin Val Glu Pro Ala Arg Val Gly Ser Asn Asp Gly Phe Asp Lys Leu 

180 185 190 

Thr Leu Glu He Leu Thr Asn Gly Thr He He Pro Glu Asp Ala Leu 

195 200 205 

Gly Leu Ser Ala Arg He Leu Thr Glu His Leu Asp Leu Phe Thr Asn 

210 215 220 

Leu Thr Glu He Ala Lys Ser Thr Glu Val Met Lys Glu Ala Asp Thr 
225 230 235 240 

Glu Ser Asp Asp Arg He Leu Asp Arg Thr He Glu Glu Leu Asp Leu 

245 250 255 

Ser Val Arg Ser Tyr Asn Cys Leu Lys Arg Ala Gly He Asn Thr Val 

260 265 270 

His Asp Leu Thr Glu Lys Ser Glu Ala Glu Met Met Lys Val Arg Asn 

275 280 285 

Leu Gly Arg Lys Ser Leu Glu Glu Val Lys Leu Lys Leu He Asp Leu 

290 295 300 

Gly Leu Gly Leu Lys Asp Lys 
305 310 



(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 
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Met Asp He Glu Lys Gin Arg Gly He Ser Val Thr Ser Ser Val Met 

15 10 15 

Gin Phe Asp Tyr Asp Gly Lys Arg Val Asn He Leu Asp Thr Pro Gly 

20 25 30 

His Glu Asp Phe Ser Glu Asp Thr Tyr Arg Thr Leu Met Ala Val Asp 

35 40 45 

Ala Ala Val Met Val Val Asp Ser Ala Lys Gly He Glu Ala Gin Thr 

50 55 60 

Lys Lys Leu Phe Glu Val Val Lys His Arg Gly He Pro Val Phe Thr 
65 70 75 go 

Phe Met Asn Lys Leu Asp Arg Asp Gly Arg Glu Pro Leu Asp Leu Leu 

85 90 95 

Gin Glu Leu Glu Glu He Leu Gly He Ala Ser Tyr Pro Met Asn Trp 

100 105 no 

Pro He Gly Met Gly Lys Ala Phe Glu Gly Leu Tyr Asp Leu Tyr Asn 

"5 120 125 

Gin Arg Leu Glu Leu Tyr Lys Gly Asp Glu Arg Phe Ala Ser Pro Arg 

130 135 140 

Arg Trp Arg Gin Thr Phe Trp 
145 150 



(2) INFORMATION FOR SEQ ID NO: 2 52: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 

Met Ala Val He Ala Glu Arg Lys Ala Tyr Tyr His Asp Pro Val Glu 

15 10 15 

Asp Ala He He Met Lys Arg Glu He Asp Glu Gly 

20 25 
(2) INFORMATION FOR SEQ ID NO: 253 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

Met Asn Glu Thr Trp Ser Arg Thr Ala Leu Thr Leu Leu Lys His Gly 

15 10 15 

Ser Glu Val Asn Leu Glu Arg Ala Leu Ser Val Asn Gly Arg Leu Gly 

20 25 30 

Gly His Val Val Thr Gly His He Asp Gly Thr Gly Lys He Ser Ser 

35 40 45 

He Lys Lys Asp Asp Asn Ala Val Trp Tyr Gin He Asn Thr Gin Lys 

50 55 60 

Glu He Leu Asp Leu He Val Glu Lys Gly Ser He Thr He Asp Gly 
65 70 75 80 

He Ser Leu Thr Val Ala Lys Val Ser Lys Val Asn Phe Ser Val Ser 

85 90 95 

Val He Pro His Thr Leu Lys Gin Thr lie Leu Lys Ser Lys Gin Val 

100 105 110 

Gly Ser Thr Val Asn Leu Glu Asn Asp He Leu Gly Lys Tyr Val Gin 

115 120 125 

Lys Leu Met Asp Asn Ser Pro Lys Ser Glu He Ser Lys Glu Leu Leu 

130 135 140 

Tyr Gin Asn Gly Phe 
145 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:254: 

Met Asn Trp Ser Ala Leu Glu Gin Gin lie Val Asp Glu Leu Lys lie 

15 10 15 

Tyr lie Ala Pro Lys lie Phe Gly Gly Ser Ala Lys Phe Pro Val Gly 
20 25 30 
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Gly Glu Gly lie Ser Leu Pro Asn Asp Ala lie Arg Leu Lys Pro Tyr 

35 40 45 

Ala Phe Ser Gin Xaa Gly Xaa Asp Tyr Leu lie Glu Ser Glu Val He 

50 55 60 

Tyr Pro Cys Ser Gin Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

Met Val Leu Phe Leu Val Pro Gin Leu Arg Asn Ala Tyr Gly Thr Ala 

15 10 15 

Ala He Gly lie lie Cys Gly Leu Tyr Trp Ala Val Ser Ser Asn Met 

20 25 30 

Thr Val Glu Ala Thr Gin Arg Leu Thr Gly Gly Gly Gly Phe Ala He 

35 40 45 

Gly His Gin Gin Gin Phe Ala He Trp Phe Val Asp Lys Val Ala Gly 

50 55 60 

Arg Phe Gly Lys Lys Glu Glu Ser Leu Asp Asn Leu Lys Leu Pro Lys 
65 70 75 80 

Phe Leu Ser He Phe His Asp Thr Val Val Ala Ser Ala Thr Phe Asp 

85 90 95 

Ala Arg He Leu Arg Gly His Ser Phe Asn Leu Gly Ser Arg His Tyr 
100 105 110 

val 



(2) INFORMATION FOR SEQ ID NO: 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

Met Leu He Asn Ser Leu Asn Gin Ser Tyr He Ser Ala Lys Arg He 

15 10 15 

Glu Glu Val Phe Ala Glu Ser Pro Glu Asn He His Ser Glu Leu Glu 

20 25 30 

Gin Lys Gin Val Thr Ser Gly Arg Val Leu Gin Val Gin Glu Leu Thr 

35 40 45 

Phe Thr Tyr Pro Asp Ala Ala Gin Pro Ser Leu Arg Asp He Ser Phe 

50 55 60 

Asp Met Thr Gin Gly Gin He Leu Gly He He Gly Gly Thr Gly Ser 
65 70 75 80 

Gly Lys Ser Ser Leu Val Gin Leu Leu Leu Gly Leu Tyr Pro Val Asp 

85 90 95 

Lys Gly Asn He Asp Leu Tyr Gin Asn Gly Arg Ser Pro Leu Asn Leu 

100 105 110 

Glu Gin Trp Arg Ser Trp He Ala Tyr Val Pro Gin Lys Val Lys Leu 

115 120 125 

Phe Lys Gly Thr He Arg Ser Asn Leu Thr Leu Gly Leu Asn Gin Glu 

130 135 140 

Val Ser Asp Gin Lys Leu Trp Gin Ala Leu Glu He Ala Gin Ala Lys 
145 150 155 160 

Asp Phe Val Ser Glu Lys Glu Gly Leu Leu Asp Ala Leu He Glu Ala 

165 170 175 

Gly Gly Arg Asn Phe Ser Gly Gly Gin Lys Gin Arg Leu Ser He Ala 

180 185 190 

Arg Ala Val Leu Arg Gin Ala Pro Phe He lie Leu Asp Asp Ala Thr 

195 200 205 

Ser Ala Leu Asp Thr lie Thr Glu Ser Lys Leu Leu Lys Ala lie Arg 

210 215 220 

Glu Asn Phe Pro Asn Thr Ser Leu He Leu lie Ser Gin Arg Thr Ser 
225 230 235 240 

Thr Leu Gin Met Ala Asp Gin He Leu Leu Leu Glu Lys Gly Glu Leu 

245 250 255 

Leu Ala Val Gly Lys His Asp Asp Leu Met Lys Ser Ser Gin Val Tyr 

260 265 270 

Arg Glu lie Asn Ala Ser Gin His Gly Lys Glu Asp 
275 280 

(2) INFORMATION FOR SEQ ID NO: 257: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 2 57: 

Met Lys Asp Pro Glu Ser Arg Leu Tyr lie Leu Leu Lys Asp Gly Gin 

Val lie Gly Thr Cys Thr Val Asp Leu Ser Thr Asn Thr Asn Tyr Phe 

20 25 30 

Tyr Gly Leu Ala lie Leu Glu Pro Glu Arg Gly Lys Gly Tyr Gly Ser 

35 40 45 

Tyr Leu Ala Lys Ser Leu Val Asn Gin Leu lie Glu Gin Asn Asp Lys 

50 55 60 

Glu Phe Gin He Ala Val Glu Asp Ser Asn Val Gly Ala Lys Arg Leu 
65 70 75 80 

Tyr Glu Lys lie Gly Phe Val Lys Gin Thr Gin Val Val Tyr Leu Asn 

85 90 95 

Glu Lys Gly Ala Arg Asp Ser Glu Val 
100 105 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

Met Phe Ala Leu Ser Ser Asn Lys Glu Leu Ala Glu Arg Val Ala Gin 

15 10 15 

Glu He Gly He Glu Leu Gly Lys Ser Ser Val Arg Gin Phe Ser Asp 



20 25 



Gly Glu lie Gin Val Asn He Glu Glu Ser lie Arg Gly 



30 



Lys His Val 

Phe lie Leu Gin Ser Thr Ser Ser Pro Val Asn Asp Asn Leu Leu Glu 

60 
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lie Leu lie Met Val Asp Ala Leu Lys Arg Ala Ser Ala Glu Ser Val 
65 70 75 80 

Asn Val Val Met Pro Tyr Tyr Gly Tyr Ala Arg Gin Asp Arg Lys Ala 

85 90 95 

Arg Ala Arg Glu Pro lie Thr Ser Lys Leu Val Ala Asn Met Leu Glu 

100 105 110 

Val Ala Gly Val Asp Arg Leu Leu Thr lie Asp Leu His Ala Ala Gin 

115 120 125 

lie Gin Gly Phe Phe Asp lie Pro Val Asp His Leu Met Gly Ala Pro 

130 135 140 

Leu lie Ala Asp Tyr Phe Glu Arg Pro Gly Met Val Gly Ser Asp Tyr 
145 150 155 160 

Val Val Val Ser Pro Asp His Gly Gly Val Thr Arg Ala Arg Lys Leu 

165 170 175 

Ala Glu Phe Leu Lys Thr Ser lie Ala lie lie Glu Lys Arg Arg Ser 

180 185 190 

Val Asp Lys Met Asn Thr Ser Glu Val Met Asn Thr lie Gly Lys Val 

195 200 205 

Glu Gly Asn His Leu 
210 

(2) INFORMATION FOR SEQ ID NO: 259: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



Met lie Ser Phe Leu Leu Leu Leu 

1 5 
Gly Tyr Arg Arg Gly Leu Leu Leu 
20 

Met Ala Ser Ala Phe Met Ala Gly 

35 40 
Gin Phe His Leu Leu Leu Pro Tyr 

50 55 
Thr Phe Phe Phe Pro Ser Asp Gin 
65 70 



Val Leu Val Trp Gly Phe Tyr lie 

10 15 
Gin Val Tyr Tyr Leu lie Ser Ala 
25 30 
Gin Phe Tyr Lys Gly Leu Gly Glu 
45 

Ala Asn Ser Gin Glu Gly Gin Gly 
60 

Leu Phe Gin Leu Asp Lys Val Phe 
75 80 
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Tyr Ala Gly He Gly Tyr Leu Leu Val Phe Gly He Val Tyr Ser He 

85 90 95 

Gly Arg Leu Leu Gly Leu Leu Leu His Leu He Pro Ser Lys Lys Leu 

100 105 no 

Gly Gly Lys Leu Phe Gin Val Ser Ala Gly He Leu Ser Met Leu Val 

115 120 125 

Thr Leu Phe Val Leu Gin Met Ala Leu Thr lie Leu Ala Thr He Pro 

130 135 140 

Met Ala Val lie Gin Asn Pro Leu Glu Lys Ser lie Val Ala Lys His 
145 150 155 160 

lie lie Gin Ser lie Pro He Thr Thr Ser Trp Leu Lys Gin He Trp 

165 170 175 

Val Thr Asn Leu He Gly 
180 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 
£A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 



Met Arg Ala Phe Ser Asp Phe Gly Thr Pro Met Leu He Gly Glu Gly 

15 io is 

Tyr Arg Thr Phe Pro Val Leu He Tyr Thr Gin Phe lie Ser Glu Val 

20 25 30 

Gly Gly Asn Ser Ala Phe Ala He Met Ala He He lie Ala Leu Ala 

35 40 45 

lie Phe Leu He Gin Lys His lie Ala Asn Arg Tyr Ser Phe Ser Met 

50 55 60 

Asn Leu Leu His Pro lie Glu Pro Lys Lys Thr Thr Lys Gly Lys Met 
65 ? 0 75 80 

Ala Ala He Tyr Ala Thr Val Tyr Gly He lie Phe lie Ser Val Leu 

85 90 95 

Pro Gin lie Tyr Leu He Tyr Thr Ser Phe Leu Lys Thr Ser Gly Met 

100 105 no 

Val Phe Val Lys Gly Tyr Ser Pro Asn Ser Tyr Lys Val Ala Phe Asn 
H5 120 125 
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Arg Met Gly Ser Ala lie Phe Asn Thr lie Arg lie Pro Leu lie Ala 

130 135 140 

Leu Val Leu Val Val Pro lie Tyr Asp lie Tyr Leu Leu Pro Ser Arg 



(2) INFORMATION FOR SEQ ID NO:261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 61: 

Met Val Pro Tyr lie Val Pro Gly Thr Val Leu Gly lie Ala Phe lie 

15 10 15 

Ser Ser Phe Asn Thr Gly Leu Phe Gly Ser Gly Phe Leu Met lie Thr 

20 25 30 

Gly Thr Ala Phe lie Leu lie Met Ser Leu Ser Val Arg Arg Leu Pro 

35 40 45 

Tyr Thr lie Arg Ser Ser Val Ala Ser Leu Gin Gin lie Ala Pro Ser 

50 55 60 

lie Glu Glu Ala Ala Gly Lys Leu Arg Lys 
65 70 

(2) INFORMATION FOR SEQ ID NO: 262 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

Met lie Phe lie Pro Met Ala Ala Tyr Ser lie Ala Arg Asn Met Ser 

1 5 10 . 15 

Lys Arg Lys Ala Phe Thr lie His Val Tyr Pro Leu Asn Pro Arg Asn 



145 



150 



155 



160 



20 



25 



30 
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Leu Arg Thr Phe Pro Ser His His Asp Ser Asp Tyr Gly Tyr Asp Glu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 



Met Lys Phe Arg Lys Leu Ala Cys 

1 5 
Leu Gly Leu Ala Ala Cys Gly Asn 
20 

Lys Ser Gly Gly Asp Gly Ala Lys 

35 40 
Pro Val Phe Thr Gin Glu Lys Thr 

50 55 
Lys Ser He He Gin Ala Phe 
65 70 



Thr Val Leu Ala Gly Ala Ala Val 

10 15 
Ser Gly Gly Ser Lys Asp Ala Ala 
25 30 
Thr Glu He Thr Trp Trp Ala Phe 
45 

Gly Asp Gly Val Gly Thr Tyr Glu 
60 



(2) INFORMATION FOR SEQ ID NO: 2 64: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 275 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 64: 

Met Asn Ser Gin Gin Lys Pro Gin Glu lie Lys Val Glu Glu Pro Val 

Glu Ser Lys Glu Glu Thr Val Asn Gin Pro Val Glu Gin Pro Lys Val 

20 25 3 0 

Glu Thr Pro Ala Val Glu Lys Gin Thr Glu Pro Thr Glu Glu Pro Lys 
35 40 45 
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Val Glu Val Thr Ser He Pro Gin Thr Thr Arg Tyr Glu Glu Asp Leu 

50 55 60 

Thr Lys Glu His Gly Thr Arg Glu Val Val Lys Glu Gly Lys Asn Gly 
65 70 75 80 

Ser Arg Thr Val Thr Thr Pro Tyr He Leu Asn Ala Thr Asp Gly Thr 

85 90 95 

Thr Thr Glu Gly Thr Ser Thr Thr Asp Glu Ala Glu Met Glu Lys Glu 

100 105 110 

Val Val Arg Val Gly Thr Lys Pro Lys Glu Lys Leu Ala Pro Val Leu 

115 120 125 

Ser Leu Thr Ser Val Thr Asp Asn Ala Met Leu Arg Ser Ala Arg Leu 

130 135 140 

Thr Tyr His Leu Glu Asn Thr Asp Ser Val Asp Val Lys Lys He His 
145 150 155 160 

Ala Glu He Lys Asn Gly Asp Lys Val Val Lys Thr He Asp Leu Ser 

165 170 175 

Lys Glu Arg Leu Ser Asp Ala Val Asp Gly Leu Glu Leu Tyr Lys Asp 

180 185 190 

Tyr Lys He Val Thr Ser Met Thr Tyr Asp Arg Gly Asn Gly Glu Glu 

195 200 205 

Thr Ser Thr Leu Glu Glu Thr Pro Leu Arg Leu Asp Leu Lys Lys Val 

210 215 220 

Glu Leu Lys Asn He Gly Ser Thr Asn Leu Val Lys Val Asn Glu Asp 
225 230 235 240 

Gly Thr Glu Val Ala Ser Asp Phe Leu Thr Ser Lys Pro Val Asp Val 

245 250 255 

Gin Asn Tyr Tyr Leu Lys Val Thr Ser Arg Asp Asn Lys Val Val Ser 
260 265 270 

Pro Pro Ser 
275 

(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 
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Met Ala Val Met Lys He Glu Tyr Tyr Ser Gin Val Leu Asp Met Glu 

1 5 io 15 

Trp Gly Val Asn Val Leu Tyr Pro Asp Ala Asn Arg Val Glu Glu Pro 

20 25 30 

Glu Cys Glu Asp He Pro Val Leu Tyr Leu Leu His Gly Met Ser Gly 

35 40 45 

Asn His Asn Ser Trp Leu Lys Arg Thr Asn Val Glu Arg Leu Leu Arg 

50 55 60 

Gly Thr Asn Leu He Val Val Met Pro Asn Thr Ser Asn Gly Trp Tyr 
65 70 75 go 

Thr Asp Thr Gin Tyr Gly Phe Asp Tyr Tyr Thr Ala Leu Ala Glu Glu 

85 90 
Leu Pro Gin Val Pro Glu Thr Leu Leu Pro 
100 105 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



95 



Met Arg Leu Leu Ala Met Lys Met Lys Gin He Ser Asp Thr Thr Leu 

15 10 15 

Lys He Thr Met Ser Leu Glu Asp Leu Met Asp Arg Gly Met Glu He 

20 25 30 

Ala Asp Phe Leu Val Pro Gin Glu Lys Thr Glu Glu Phe Phe Tyr Ala 

35 40 45 

He Leu Gly 
50 

<2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

Met Leu Pro He He Cys Cys Gly Glu Ser Leu Glu Thr Tyr Glu Ala 

15 10 15 

Gly Lys Ala Ala Glu Phe Val Gly Ala Gin Val Ser Ala Ala Leu Ala 

20 25 30 

Gly Leu Thr Ala Glu Gin Val Ala Ala Ser Val He Ala Tyr Glu Pro 

35 40 45 

He Trp Ala He Gly Thr Gly Lys Ser Ala Ser Gin Asp Asp Ala Gin 

50 .55 60 

Lys Met Cys Lys Val Val Arg Asp Val Val Ala Ala Asp Phe Gly Gin 
65 70 75 80 

Glu Val Ala Asp Lys Val Arg Val Gin Tyr Gly Gly Ser Val Lys Pro 

85 90 95 

Glu Asn Val Ala Ser Tyr Met Ala Cys Pro Asp Val Asp Gly Ala Leu 

100 105 HO 

Val Gly Gly Ala Ser Leu Glu Ala Glu Ser Phe Leu Ala Leu Leu Asp 

115 120 125 

Phe Val Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 268: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

Met Phe Val Glu Ala Cys Lys Ala Val Val Arg Ala Asn Glu Glu Tyr 

15 10 15 

Val Pro Pro Tyr Gly He Gly Gly Thr Leu Tyr Leu Arg Pro Leu Leu 

20 25 30 

He Gly Val Gly Asp He He Gly Val Lys Pro Ala Glu Glu Tyr He 

35 40 45 

Phe Thr He Phe Ala Met Pro Val Gly Asn Tyr Phe Lys Gly Gly Leu 
50 55 60 
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Val Pro Thr Asn Phe Leu lie Gin Asp Glu Tyr Asp Arg Ala Ala Pro 
65 7 ° 75 80 

Asn Gly Thr Gly Ala Ala Lys Val Gly Gly Asn Tyr Ala Ala Ser Leu 

85 90 95 

Leu Pro Gly Lys Met Ala Lys Ser Arg His Phe Ser Asp Val He Tyr 

100 105 no 

Leu Asp Pro Ser Thr His Thr Lys He Glu Glu Val Gly Ser Ala Asn 

115 120 125 

Phe Phe Gly He Thr Ala Asp Asn Glu Phe Val Thr Pro Leu Ser Pro 

130 135 140 

Ser He Leu Pro Ser He Thr Lys Tyr Ser Leu Leu Tyr Leu Ala Glu 
145 150 155 160 

His Arg Leu Gly Leu Thr Pro He Glu Gly Asp Val Pro He Asp Asn 

165 170 175 

Leu Asp Arg Phe Val Lys Ala Gly Ala Cys Gly Thr Ala Ala Val He 

180 185 190 

Ser Pro lie Gly Gly He Gin His Gly Asp Asp Phe His Val He Leu 
i95 200 205 



(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 



Met Val Met He Ser Met Leu Phe Tyr Ser Glu Thr Glu Val Gly Pro 

1 5 10 15 

Val Thr Arg Lys Leu Tyr Asn Glu Leu Thr Gly He Gin Phe Gly Asp 

20 25 30 

He Glu Ala Pro Glu Gly Trp He Val Lys Val Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

Met Thr Ala Ser Pro Leu Lys Lys Ser lie Lys Lys Lys Asn Arg Lys 

15 10 15 

Leu Thr Asn Lys Xaa Glu Lys His Met Ser Lys lie lie Gly lie Asp 

20 25 30 

Leu Gly Thr Thr Asn Ser Ala Val Ala Val Leu Glu Gly Thr Glu Ser 

35 40 45 

Lys lie lie Ala Asn Pro Glu Gly Asn Arg Thr Thr Pro Ser Val Val 

50 55 60 

Ser Phe Lys Asn Gly Glu lie lie Val Gly Asp Ala Ala Lys Arg Gin 
65 70 75 80 

Ala Val Thr Asn Pro Asp Thr Val lie Ser lie Lys Ser Lys Met Gly 

85 90 95 

Thr Ser Glu Lys Val Ser Ala Asn Gly Lys Glu Tyr Thr Pro Gin Glu 

100 105 110 

lie Ser Ala Met lie Leu Gin Tyr Leu Lys Gly Tyr Ala Glu Asp Tyr 

115 120 125 

Leu Gly Glu Lys Val Thr Lys Ala Val lie Thr Val Pro Ala Tyr Phe 

130 135 140 

Asn Asp Ala Gin Arg Gin Ala Thr Lys Asp Ala Gly Lys lie Ala Gly 
145 150 155 160 

Leu Glu Val Xaa Arg lie Val Asn Glu Pro Thr Ala Ala Ala Leu Ala 

165 170 175 

Tyr Gly Leu Asp Lys Thr Asp Lys Glu Glu Lys He Leu Val Phe Asp 

180 185 190 

Leu Gly Gly Gly Thr Phe Asp Val Ser He Leu Glu Leu Gly Asp Gly 

195 200 205 

Val Phe Asp Val Leu Ser Thr Ala Gly Asp Asn Lys Leu Gly Gly Asp 

210 215 220 

Asp Phe Asp Gin Lys He He Asp His Leu Val Ala Glu Phe Lys Lys 
225 230 235 240 

Glu Asn Gly He Arg Leu Val Tyr 
245 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Met Lys Leu He Val Asp Leu He Tyr Glu Gly Gly Phe Lys Lys Met 

15 10 15 

Arg Gin Ser He Ser Asn Thr Ala Glu Tyr Gly Asp Tyr Val Ser Gly 

20 25 30 

Pro Arg Val He Thr Glu Gin Val Lys Glu Asn Met Lys Ala Val Leu 

35 40 45 

Ala Asp He Gin Asn Gly Lys Phe Ala Asn Asp Phe Val Asn Asp Tyr 

50 55 60 

Lys Ala Gly Arg Pro Lys Leu Thr Ala Tyr Arg Glu Gin Ala Ala Asn 
65 7 0 75 go 

Leu Glu He Glu Lys Val Gly Ala Glu Leu Arg Lys Ala Met Pro Phe 

85 90 95 

Val Gly Lys Asn Asp Asp Asp Ala Phe Lys lie Tyr Asn 
100 105 

(2) INFORMATION FOR SEQ ID NO: 272: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

Met Leu Leu Ser Ala Phe His Lys Tyr Glu Asn Gin Leu Asp Lys Val 

15 10 15 

Arg Phe He Gly Leu His Thr Gly His Leu Gly Phe Tyr Thr Asp Tyr 

20 25 3 0 

Arg Asp Phe Glu Leu Asp Lys Leu Val Thr Asn Leu Gin Leu Asp Thr 

35 40 45 

Gly Ala Arg Val Ser Tyr Pro Val Leu Asn Val Lys Val Phe Leu Glu 

50 55 60 

Asn Gly Glu Val Lys He Phe Arg Ala Leu Asn Glu Ala Ser lie Arg 
65 70 75 80 
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Arg Ser Asp Arg Thr Met Val Ala Asp He Val He Asn Gly Val Pro 

85 90 95 

Phe Glu Arg Phe Arg Gly Asp Gly Leu Thr Val Ser Thr Pro Thr Gly 

100 105 110 

Ser Thr Ala Tyr Asn . 
115 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:273: 

Met Gly Ala Gin Leu Ala Arg Glu Phe Lys His Glu Ala Asp He Val 

1 5 10 15 

Val Gly Val Pro Asn Ser Ser Leu Ser Ala Ala Met Gly Phe Ala Glu 

20 25 30 

Glu Ser Gly Leu Pro Asn Glu Met Gly Leu He Lys Asn Gin Tyr Thr 

35 40 45 

Gin Arg Thr Phe He Gin Pro Thr Gin Glu Leu Arg Glu Gin Gly Val 

50 55 60 

Arg Met Lys Leu Ser Ala Val Ser Gly Val Val Lys Gly Lys Arg Val 
65 70 75 80 

Val Met Val Asp Asp Ser He Val Arg Gly Thr Thr Ser Arg Arg He 

85 90 95 

Val Gin Leu Leu Lys Glu Ala Gly Ala Thr Glu Val His Val Ala He 

100 105 110 

Gly Ser Pro Ala Leu Ala Tyr Pro Cys Phe Tyr Gly He Asp He Gin 

115 120 125 

Thr Arg Gin Glu Leu He Ala Ala Asn His Thr Val Glu Glu Thr Arg 

130. 135 140 

Gin He He Gly Ala Asp Ser Leu Thr Tyr Leu Ser He Asp Ser Leu 
145 150 155 160 

lie Glu Ser He Gly He Glu Thr Asp Ala Pro Asn Gly Gly Leu Cys 



165 



170 



175 



Val 



Ala Tyr Phe Asp Gly 
180 



Asp Tyr Pro Thr 



185 



Pro 



Leu Tyr Asp Tyr Glu 



190 - 
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Glu Asp Tyr Arg Arg Ser Leu Glu Glu Lys Thr Ser Phe Tyr Lys 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 274: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 



Met Lys He Leu Val Thr Gly Phe Asn Pro Phe Gly Gly Glu Lys He 

1 5 io 15 

Asn Pro Ala Leu Glu Ala Val Lys Leu Leu Pro Ser Glu He Asn Gly 

20 25 30 

Ala Glu Val Arg Trp Val Glu He Pro Thr Val Phe Tyr Lys Ser Ser 

35 40 45 

Glu Val Leu Glu Ala Glu He Leu Arg Tyr Gin Pro Asp Ala Val Leu 

50 55 60 

Cys He Gly Gin Ala Gly Gly Arg Thr Gly Leu Thr Pro Glu Arg Val 
65 70 75 80 

Ala He Asn Gin Asp Asp Ala Arg He Pro Asp Asn Glu Gly Asn Gin 

85 90 95 

Pro He Asp Thr Pro He Arg He Asp Gly Ala Ser Ala Tyr Phe Ser 

1°0 105 no 

Ser Leu Pro He Lys Ala Met Val Gin Ala Asn Lys Lys Glu Gly Leu 

115 120 125 

Thr Gly Ser Leu Phe Pro He Arg Ala Gly Thr Phe Val Cys Ser His 

130 135 140 

Leu Met Tyr Gin Ala Leu Tyr Leu Val Glu Lys Lys Phe Pro Tyr Val 
145 150 155 i6o 

Lys Ala Gly Phe Met His He Pro Tyr Met Met Glu Gin Val Val Asn 

165 170 175 

Arg Pro Thr Thr Pro Thr Met Ser Leu Val Asp He Arg Arg Gly He 

180 185 i9 0 

Glu Ala Ala lie Gly Ala Met He Glu His Gly Asp Gin Glu Leu Thr 

195 . 200 205 

Leu Val Gly Gly Glu He His 
210 215 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 

Met Leu Val Pro Lys Arg Val Lys His Arg Arg Glu Phe Arg Gly Lys 

1 " 5 10 15 

Met Arg Gly Glu Ala Lys Gly Gly Lys Glu Val Ala Phe Gly Glu Tyr 

20 25 30 

Gly Leu Gin Ala Thr Thr Ser His Trp lie Thr Asn Arg Gin lie Glu 

35 40 45 

Ala Ala Arg lie Ala Met Thr Arg Tyr Met Lys Arg Gly Gly Lys Val 

50 55 60 

Trp lie Lys lie Phe Pro His Lys Ser Tyr Thr Ala Lys Ala lie Gly 
65 70 75 80 

Val Arg Met Gly Ser Gly Lys Gly Ala Pro Glu Gly Trp Val Ala Pro 

85 90 95 

Val Lys Arg Gly Lys Val Met Phe Glu lie Ala Gly Val Ser Glu Glu 

100 105 110 

lie Ala Arg Glu Ala Leu Arg Leu Ala Ser His Lys Leu Pro Val 
115 120 125 

(2) INFORMATION FOR SEQ ID NO: 276 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Met Gly Trp Trp Arg Glu Thr lie Asp He Val Lys Glu Asn Asp Pro 
15 10 15 
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Ala Ala Arg Thr Thr Leu Glu Val Leu Leu Thr Tyr Pro Gly Val Lys 

20 25 30 

Ala Leu Ala Ala His Arg Leu Ser His Phe Leu Trp Lys Tyr Asp Phe 

35 40 45 

Lys Leu Leu Ala Arg Met His Ser Gin Phe Trp Arg Phe Trp Thr Gin 

50 55 60 

He Glu He His Pro Gly Ala Gin He Asp Ser Gly Val phe He Asp 
65 70 75 80 

His Gly Ser Gly Leu Val He Gly Glu Thr Ala He Val Glu Lys Gly 

85 90 95 

Val Leu Leu Tyr His Gly Val Thr Leu Gly Gly Thr Gly Lys Asp Cys 

100 105 no 

Gly Lys Arg His Pro Thr Val Arg Lys Gly Ala Leu He Ser Ala His 

H5 120 125 

Ala Gin Val He Gly Pro Val Glu He Gly Glu Asn Ala Lys Val Gly 

130 135 140 

Ala Ala Ala Val Val Val Ala Asp Val Pro Ser Asp Val Thr Val Val 
145 150 155 160 

Gly He Pro Ala Lys He Val Arg Leu His Gly Lys Lys Asp Glu Pro 

165 170 175 

Val He His Glu Val Glu Glu Lys Arg Glu Tyr Tyr Val Asn Lys Leu 

180 185 190 

Glu Gin Ala Lys Asp Ala Ser His Arg Ser Ser Gly Leu 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Met Leu Phe Tyr Leu Leu Arg Asp Gly Lys Gly Leu Arg Asn Tyr Leu 

15 10 15 

Thr Gin Phe He Pro Ser Lys Leu Lys Glu Pro Val Gly Gin Val Leu 

20 25 30 

Ser Asp Val Asn Gin Gin Leu Ser Asn Tyr Val Arg Gly Gin Val Thr 
35 40 45 
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Val Ala lie He Val Ala Val Met Phe He He Phe Phe Lys He He 

50 55 60 

Gly Leu Arg Tyr Ala Val Thr Leu Gly Val Thr Ala Gly He Leu Asn 
65 70 75 80 

Leu Val Pro Tyr Leu Gly Ser Phe Leu Ala Met Leu Pro Ala Leu Val 

85 90 95 

Leu Gly Leu He Ala Gly Pro Val Met Leu Leu Lys Val Val He Val 

100 105 no 

Phe He Val Glu Gin Thr He Glu Gly Arg Phe Val Ser Pro Leu He 

115 120 125 

Leu Gly Ser Gin Leu Asn He His Pro He Asn Val Leu Phe Val Leu 

130 135 140 

Leu Thr Ser Gly Ser Met Phe Gly He Trp Gly Val Leu Leu Gly He 
145 150 155 160 

Pro Val Tyr Ala Ser Ala Lys Val Val He Ser Ala He Phe Glu Trp 

165 170 175 

Tyr Lys Val Val Ser Gly Leu Tyr Glu Leu Glu Gly Glu Glu Val Lys 
180 185 190 

Ser Glu Gin 
195 



(2) INFORMATION FOR SEQ ID NO:278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 



Met Thr Lys Gin Val Pro Lys Phe 

1 5 
Thr Cys Lys Trp Leu Leu Tyr Asn 
20 

Thr Val Val Asn His Ser Val Arg 

35 40 
Thr Asn Thr He Glu Glu Thr Gly 
50 55 



Thr Lys Asp Thr Ala Gin Leu Tyr 

10 15 
Lys Val Thr Lys Met Tyr Asp His 
25 30 
Glu Tyr He Thr Asp Ser He Ser 
45 



(2) INFORMATION FOR SEQ ID NO:279: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



Met Lys Lys Arg Ala lie Val Ala Val lie Val Leu Leu Leu lie Gly 

15 10 15 

Leu Asp Gin Leu Val Lys Ser Tyr lie Val Gin Gin He Pro Leu Gly 

20 25 30 

Glu Val Arg Ser Trp He Pro Asn Phe Val Ser Leu Thr Tyr Leu Gin 

35 40 45 

Asn Arg Gly Ala Ala Phe Ser He Leu Gin Asp Gin Gin Leu Leu Phe 

50 55 60 

Ala Val He Thr Leu Val Val Val He Gly Ala He Trp Tyr Leu His 
65 70 75 80 

Lys His Met Glu Asp Ser Phe Trp Met Val Leu Gly Leu Thr Leu He 

85 90 95 

He Ala Gly Gly Leu Gly Asn Phe He Asp Arg Val Ser Gin Gly Phe 

100 105 no 

Val Val Asp Met Phe His Leu Asp Phe He Asn Phe Ala He Phe Asn 

115 120 125 

Val Ala Asp Asn Tyr Leu Thr Val Gly Val lie He Leu Leu He Ala 

130 135 140 

Met Leu Lys Glu Glu He Asn Gly Asn 
145 150 



(2) INFORMATION FOR SEQ ID NO:280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 
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Met Gly Ala Tyr Tyr Leu Val Met Gin Ser Leu Ser Tyr Leu Glu Tyr 

15 10 15 

Glu Gin Gly He Gin Ser Thr Thr Val Arg His Leu He Leu Val Phe 

20 25 30 

Tyr Leu Leu Phe Phe Met Gly Gly He Lys Lys Leu Asp Thr Tyr Leu 

35 40 45 

Lys Glu Lys Leu Gin Glu Glu Leu Asn Gin Glu Gin Thr Leu Arg Tyr 

50 55 60 

Arg Asp Met Glu Arg Tyr Ser Arg His He Glu Glu Leu Tyr Lys Glu 
65 70 75 80 

He Arg Ser Phe Arg His Asp Tyr Thr Asn Leu Leu Thr Thr Tyr Val 

65 90 95 

Trp Ala Leu Lys Arg Arg lie Trp Ser Arg 
100 105 



(2) INFORMATION FOR SEQ ID NO: 281: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



Met His Val Arg Met He Pro Lys Ser Thr Pro Asp Thr Lys Phe Ala 

15 10 15 

Asp Val Ala Thr His Gin Pro Glu Tyr Ser Arg Asp Asn Val Ala Gly 

20 25 30 

Thr He Val Gly Phe Trp Thr Pro Glu He Phe His Gly Val Ser Val 

35 40 45 

Ala Gly Tyr His Leu His Phe He Ser Asp Asp Leu Thr Phe Gly Gly 

50 55 60 

His Val Met Asp Phe Val He Lys Glu Gly He He Glu Val Gly Ala 
65 70 75 80 

Val Asp Gin Leu Asp Gin Arg Phe Pro Val Gin Asp Arg Gin Tyr Leu 

85 90 95 

Phe Ala Lys Phe Asn Val Asp Glu Met Lys Lys Asp He Glu Lys Ala 
100 105 110 

Glu 
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(2) INFORMATION FOR SEQ ID NO: 282: 



PCI7US97/05306 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 



Met Ala Val Ala Lys Gly Lys Leu Thr He lie Ala His Val Ala Cys 

Asn Asn Thr Lys Asp Ser Met Glu Leu Ala Arg His Ala Glu Ser Leu 

20 25 30 - 

Gly Val Asp Ala He Ala Thr Asp Ser Thr Asn Leu Phe Pro Leu Ala 

35 40 45 

Arg lie Thr Gin Leu Pro Asn Thr Gly Thr lie Ser Val Leu Gin Leu 

50 55 go 

Gin Thr Gin Thr Thr 
65 

(2) INFORMATION FOR SEQ ID NO: 2 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

Met Gly Glu Thr Gin He lie Asp Gly Leu Asp Pro Glu Tyr Lys Lys 

Arg Phe Met His His Tyr Asn Phe Pro Gin Tyr Ser Val Gly Glu Thr 

20 25 30 

Gly Arg Tyr Gly Ala Pro Gly Arg Arg Glu lie Gly His Gly Ala Leu 

35 40 45 

Gly Glu Arg Ala Leu Ala Gin Val Leu Pro Ser Leu Glu Glu Phe Pro 
50 55 60 
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Tyx Ala lie Arg Leu Val Ala Glu Val Leu Glu Ser Asn Gly Ser Ser 
65 70 75 80 

Ser Gin Ala Ser lie Cys Ala Gly Thr Leu Ala Leu Met Ala Gly Gly 

85 90 95 

Val Pro lie Lys Ala Pro Val Ala Gly lie Ala Met Gly Leu lie Ser 

100 105 110 

Asp Gly Asn Asn Tyr Thr Val Leu Thr Asp lie Gin Gly Leu Glu Asp 

115 120 125 

His Phe Gly Asp Met Asp Phe Lys Val Ala Gly Thr Arg Asp Gly He 

130 135 140 

Thr Ala Leu Gin Met Asp He Lys He Gin Gly He Thr Ala Glu He 
145 150 155 160 . 

Leu Thr Glu Ala Leu Ala Gin Ala Lys Lys Ala Arg Phe Glu He Leu 

165 170 175 

Asp Val lie Glu Ala Thr He Pro Glu Val Arg Pro Glu Leu Ala Pro 

180 185 190 

Thr Ala Pro Lys He Asp Thr He Lys He Asp Val Asp Lys He Lys 

195 200 205 

He Val He Gly Lys Gly Gly Glu Thr He Asp Lys He He Ala Glu 

210 215 220 

Thr Gly Val Lys He Asp He Asp Glu Glu Xaa Asn Val Phe Tyr Leu 
225 230 235 240 

Leu Leu Val Asp Gin Asn Ala He Asn Pro Cys Pro Lys Lys Leu Leu 

245 250 255 

Leu Val Trp Phe Arg Glu Pro Lys Trp Met Lys Phe Thr Val Leu Asn 

260 265 270 

Arg Ser Tyr Arg Glu Phe Gly Ala Phe Val Thr Leu 
275 280 

(2) INFORMATION FOR SEQ ID NO: 284: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

Met lie Asn Asn Val Val Leu Val Gly Arg Met Thr Arg Asp Ala Glu 
1 5 10 15 
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Leu Arg Tyr Thr Pro Ser Asn Val Ala Val Ala Thr Phe Thr Leu Ala 

20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 

35 40 45 

lie Asn Val Val Met Trp Arg Gin Gin Ala Glu Asn Leu Ala Asn Trp 

50 55 60 

Ala Lys Lys Gly Ser Leu He Gly Val Thr Gly Arg He Gin Thr Arg 
65 70 75 eo 

Ser Tyr Asp Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 

Ala Glu Asn Phe Gin Met Leu Glu Ser Arg Ser Val Arg Glu Gly His 

100 105 no 

Thr Gly Gly Ala Tyr Ser Ala Pro Thr Ala Asn Tyr Ser Ala Pro Thr 

115 120 125 

Asn Ser Val Pro Asp Phe Ser Arg Asn Glu Asn Pro Phe Gly Ala Thr 

130 135 140 

Asn Pro Leu Asp He Ser Xaa Asp Asp Leu Pro Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO:285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Met Asp Leu Val Arg He Leu Lys Glu Pro Arg Asn Ala Leu Val Lys 

1 5 io 15 

Gin Tyr Gin Thr Leu Leu Ser Tyr Asp Asp Val Glu Leu Glu Phe Asp 

20 25 30 

Asp Glu Ala Leu Gin Glu He Ala Asn Lys Ala lie Glu Arg Lys Thr 

35 40 45 

Gly Ala Arg Gly Leu Arg Ser He He Glu Glu Thr Met Leu Asp Val 

50 55 60 

Met Phe Glu Val Pro Ser Gin Glu Asn Val Lys Leu Val Arg He Thr 
65 70 75 80 

Lys Glu Thr Val Asp Gly Thr Asp Lys Pro He Leu Glu Thr Ala 
85 90 95 
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(2) INFORMATION FOR SEQ ID NO: 286: 



PCTYUS97/053O6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

Met Thr Glu Ala Gly Tyr Val Gly Glu Asp Val Glu Asn lie Leu Leu 

15 10 15 

Lys Leu Leu Gin Val Ala Asp Phe Asn lie Glu Arg Ala Glu Arg Gly 

20 25 30 

lie lie Tyr Val Asp Glu lie Asp Lys lie Ala Lys Lys Ser Glu Asn 

35 40 45 

Val Ser lie Thr Arg Asp Val Ser Gly Glu Gly Val Gin Gin Ala Leu 

50 55 60 

Leu Lys lie He Glu Gly Thr Val Ala Ser Val Pro Pro Gin Gly Gly 
65 70 75 80 

Arg Lys His Pro Gin Gin Val Asp Asp Ser Ser Gly Tyr Lys Lys Tyr 

85 90 95 

Pro Leu His Arg Gly Trp Cys Phe 
100 

(2) INFORMATION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

Met Trp Pro Arg Glu Ser Arg Ser Cys Arg Leu Thr Lys Val Lys He 
15 10 15 
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Cys Gly Leu Ser Thr Lys Glu Ala Val Glu Thr Ala Val Ser Ala Gly 

20 25 30 

Ala Asp Tyr He Gly Phe Val Phe Ala Pro Ser Lys Arg Gin Val Thr 

35 40 45 

Leu Glu Glu Ala Ala Glu Leu Ala Lys Leu He Pro Ala Asp Val Lys 

50 55 60 

Lys Val Gly Val Phe Val Ser Pro Ser Arg Val Glu Leu Leu Glu Ala 
65 70 75 bo 

He Asp Lys Val Gly Leu Asp Leu Val Gin Val His Gly Gin Val Ala 

85 90 95 

Asp Asp Leu Phe Glu Asn Leu Pro Cys Ala Ser lie Gin Ala Val Gin 

100 105 no 

Val Asp Gly Asn Gly His Val Pro Asn Ser Gin Ala Asp Tyr Leu Leu 

115 120 125 

Phe Asp Ala Pro Val Ala Gly Ser Gly Gin Ser Phe Asp Trp Gly Gin 

130 135 140 

Leu Asp Thr Thr Gly Leu Ala Gin Pro Phe Phe He Ala Gly Gly Leu 
145 "0 155 160 

Asn Glu Asp Asn Val Val Lys Ala He Gin His Phe Thr Pro Tyr Ala 

165 170 175 

Val Asp Val Ser Ser Gly Val Glu Thr Asp Gly Gin Lys Asp His Glu 

180 185 190 

Lys He Arg Arg Phe He Glu Arg Val Lys His Gly He Ser Gly Thr 
i95 200 205 

Lys 



(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Met Asn Cys Glu Ala Val Ala Leu Gly Ser Phe Cys Glu Leu Lys Ser 

15 10 15 

Arg Arg Gly Lys Lys Gin Lys Gly Glu He Leu Met Ala Val He Ser 
20 25 30 
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Met Lys Gin Leu Leu Glu Ala Gly Val His Phe Gly His Gin Thr Arg 

35 40 45 

Arg Trp Asn Pro Lys Met Ala Lys Tyr lie Phe Thr Glu Arg Asn Gly 

50 55 60 

lie His Val lie Asp Leu Gin Gin Thr Val Lys Tyr Ala Asp Gin Ala 
65 70 75 80 

Tyr xaa Phe Met Arg Asp Ala Ala Ala Asn Asp Ala Val Val Leu Phe 

85 90 95 

Val Gly Thr Lys Lys Thr Ser Ser 
100 

(2) INFORMATION FOR SEQ ID NO:289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Met Val Asp Thr Asn Thr Asp Pro Asp Asp lie Asp Val lie He Pro 

15 10 15 

Ala Asn Asp Asp Ala He Arg Ala Val Lys Leu He Thr Ala Lys Leu 

20 25 30 

Ala Asp Ala He He Glu Gly Arg Gin Gly Glu Asp Ala Val Ala Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Met Glu Glu Leu Gly Gin Ser Tyr Gly Tyr Leu Leu Tyr Arg Thr Glu 
15 10 15 
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Thr Asn Trp Asp Ala Glu Glu Glu Arg Leu Arg lie He Asp Gly Arg 

20 25 30 

Asp Arg Ala Gin Leu Tyr Val Asp Gly Gin Trp Val Lys Thr Gin Tyr 

35 40 45 

Gin Thr Glu He Gly Glu Asp He Phe Tyr Gin Gly Lys Lys Lys Gly 

50 55 60 

Leu Ser Arg Leu Asp He Leu He Glu Asn Met Gly Arg Val Asn Tyr 
65 70 75 80 

Gly His Lys Phe Leu Ala Asp Thr Gin Arg Lys Gly He Arg Thr Gly 

85 90 95 

Val Cys Lys Asp Leu His Phe Leu Leu Asn Trp Lys His Tyr Pro Leu 

100 105 no 

Pro Leu Asp Asn Pro Glu Lys He Asp Phe Ser Lys Gly Trp Thr Gin 

X1 5 120 125 

Gly Gin Pro Ala Phe Tyr Ala Tyr Asp Phe Thr Val Glu 
130 135 140 

(2) INFORMATION FOR SEQ ID NO:291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:291: 

Met Gly Lys Glu Lys Val Trp Leu Pro Arg Lys Gin Gly Leu Pro Gly 

1 5 10 15 

Leu Ala Tyr Tyr Val He Glu Val Ala His Lys Glu Glu Leu Leu Thr 

20 25 30 

He Ala Gin Arg Ala Gin Glu Val Asp Val Pro He Lys Trp Met Thr 

35 40 45 

Ser Ser Gin Leu Glu He Thr Asp Ser Asp Gly He Val Thr Cys He 

50 55 60 

Arg Leu Ala Arg 
65 

(2) INFORMATION FOR SEQ ID NO:292: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

Met Asn Lys Lys Gin Trp Leu Gly Leu Gly Leu Val Ala Val Ala Ala 

15 10 15 

Val Gly Leu Ala Ala Cys Gly Asn Arg Ser Ser Arg Asn Ala Ala Ser 

20 25 30 

Ser Ser Asp Val Lys Thr Lys Ala Ala lie Val Thr Asp Thr Gly Gly 

35 40 45 

Val Asp Asp Lys Ser Phe Asn Gin Ser Ala Trp Glu Val Ala Gly Leu 
50 55 60 

Gly 
65 

(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

Met Ser Gin Trp Asp Arg Lys Leu Asp Ala Arg Leu Ala Gin Ala Val 

15 10 15 

Val Ser lie Asn Ala Phe Lys Gly Val Glu Phe Gly Leu Gly Phe Glu 

20 25 30 

Ala Gly Tyr Arg Lys Gly Ser Gin Val Met Asp Glu lie Leu Trp Ser 

35 40 45 

Lys Glu Asp Gly Tyr Thr Arg Arg Thr Asn Asn Leu Gly Gly Phe Glu 

50 55 60 

Gly Gly Met Thr Asn Gly Gin Pro lie Val Val Arg Gly Val Met Lys 
65 70 75 80 

Pro lie Pro Thr Leu Tyr Lys Pro Leu Met Ser Val Asp lie Glu Thr 
85 90 95 
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His Glu Pro Tyr Lys Ala Thr Val Glu Arg Ser Asp Pro Thr Ala Leu 

100 105 no 

Pro Ala Ala Gly Met Val Met Glu Ala Val Val Ala Thr Val Leu Ala 

115 120 125 

Gin Glu lie Leu Glu Lys Phe Ser Ser Asp Asn Leu Glu Glu Leu Lys 

130 135 140 

Glu Ala Val Ala Lys His Arg Asp Tyr Thr Lys Asn Tyr 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Met Glu Asp ser Asn Ser Phe Met Leu Ser His Gin Met Glu Thr 11 

15 10 15 

Leu Lys Glu Ala Gly Phe Thr Lys Ala Val Ser Tyr Phe He Leu Glu 

20 25 30 

Leu Lys Asp Pro Ser Gin Thr Lys Val Val Thr Glu Glu Leu Gin Lys 

35 40 45 

Asn Lys Lys Tyr Thr Val Leu Ser 
50 55 



e 



(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



Met Gly Lys Lys Arg Trp Ala Arg Asn Gly Phe Glu Ser Asn Asp Ala 
15 10 15 
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Ser Tyr Ala Gin val Val Ser Leu 
20 

Ser Asn Asn Glu Thr Asp Lys Val 

35 40 
Thr Asn Glu Gin Gly Leu Thr He 
50 55 

Glu 
65 
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Tyr Asp Asp Thr Ser He Ser Val 
25 30 
Leu Ala Gly Ser Leu Tyr Thr Asp 
45 

Pro Ser Ser Phe Thr Lys Lys Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 296: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

Met Ala Met Glu Ser Gly Ala Thr Ala Val Val Ala Glu Arg Gly Gin 

15 10 15 

Glu Arg He Thr Lys Val Arg Glu He Leu Gly Gly Gly Ala Asp Ala 

20 25 30 

Ala Leu Glu Cys Val Gly Thr Glu Ala Ala He Glu Gin Ala Leu Gly 

35 40 45 

Val Leu His Asn Gly Gly Arg Met Gly Phe Val Gly Val Pro His Tyr 

50 55 60 

Asn Asn Arg Ala Leu Gly Ser Thr Phe Met Gin Asn He Ser Val Ala 
65 70 75 80 

Gly Gly Ala Ala Ser Ala Thr Thr Tyr Asp Lys Gin Phe Leu Leu Lys 

85 90 95 

Ala Val Leu Asp Gly Asp He Asn Pro Gly Arg Val Phe Thr Ser Ser 

100 105 HO 

Tyr Lys Leu Glu Asp He Asp Gin Ala Tyr Lys Asp Met Asp Glu Arg 

115 120 125 

Lys Thr He Lys Ser Met He Val He Glu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97: 

Met Lys Gin Val Val Asp Tyr Phe Leu Ser Gin Gly Met Asp Arg He 

Gly He Leu Thr Gly Leu Glu Glu Thr Thr Asp Gin Glu Glu He He 

20 25 30 

Gin Asp Lys Arg Leu Glu Asn Phe Lys Asn Tyr Ser Gin Ala Arg Gly 

35 40 45 

He Tyr His Asp Glu Leu Val Phe Gin Gly Arg Phe Thr Ala Gin Ser 

50 55 6 o 

Gly Tyr Asp Leu Met Lys Glu Ala He Gin Ser Leu Gly Asp Gin Leu 
65 7 0 75 80 

Pro Pro Ala Phe Phe Ala Ala Ser Asp Ser Leu Ala He Gly Ala Leu 

85 90 95 

Arg Ala Leu Gin Glu Ala Gly He Ser Leu Pro Asp Arg Val Ser Leu 

100 105 no 

He Phe Leu Leu Thr Thr Leu Ser Leu Thr Lys Gin Val Tyr Pro Pro 
li5 120 125 

Leu Ser 
130 

(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 

Met Ala Lys Val Thr lie Met Leu Ala Cys Ala Ala Gly Met Ser Thr 

15 10 15 

Ser Leu Leu Val Thr Lys Met Gin Lys Ala Ala Glu Asp Lys Gly Leu 
20 25 30 
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Asp Ala Glu He Phe Ala Val Pro 

35 40 
Ala Thr Lys Glu Val Asn Val Leu 

50 55 
Leu Leu Gly Asp Phe Gin Glu Lys 
65 70 
Ala Val He Pro Met Thr Asp Tyr 
85 

Leu Asp Leu Ala Glu Ser Leu Leu 
100 
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Ala Pro Glu Ala Glu Glu He Val 
45 

Leu Leu Gly Pro Gin Val Arg Tyr 
60 

Leu Lys Asp Arg Gin He Pro Val 

75 80 
Gly Met Met Asn Gly Ser Lys Val 
90 95 

Asp 

105 



(2) INFORMATION FOR SEQ ID NO: 299: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:299: 



Met His Gly Ala Gly Val Asn Glu Pro Val Ala Glu Leu Ser Val Ala 

1 5 10 15 

Glu Gin Leu Leu Glu Ala Gly Ala Asp Val He Leu Val Pro Ala Val 

20 25 30 

Gly Thr Val Pro Ala Phe His Asp Gin Glu Leu Arg Glu Val Val Asp 

35 40 45 

Leu Val His Ser Lys Gly Gly Leu Val Leu Ser Ala He Gly Thr Ser 

50 55 60 

Gin Glu Thr Ser Asp Thr Asp Thr He Lys Glu He Ala Leu Arg Asn 
65 70 75 80 

Lys He Cys Gly Val Asp He Gin His He Gly Asp Ala Gly Tyr Gly 

85 90 95 

Gly Leu Ala Thr Val Asp Asn He Tyr Ala Leu Ser Lys Ala He Arg 

100 105 110 

Gly Val Arg His Thr Val Ser Arg Leu Ala Arg Ser Val Asn Arg 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 300 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 6 amino acids 

238 



WO 97/37026 PCT/US97/05306 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 



Met Ser Ala lie Asn Leu Ala Ser Val Pro Lys Glu Ser Leu Thr Gin 

15 10 15 

Val Leu Pro Arg Asp Leu His Ala Glu Tyr Phe Ala Val Leu Ala Ser 

20 25 30 

He Ala Thr Ser He Glu Arg Met Ala Thr Glu He Arg Gly Leu Gin 

35 40 45 

Lys Ser Glu Gin Arg Glu Val Glu Glu Phe Phe Ala Lys Gly Gin Lys 

50 55 60 

Gly Ser Ser Ala Met Pro His Lys Arg Asn Pro He Gly Ser Glu Asn 
65 70 75 80 

Met Thr Gly Leu Ala Arg Val He Arg Gly His Met He Thr Ala Tyr 

85 90 95 

Glu Asn Val Ala Leu Trp His Glu Arg Asp He Ser His Ser Ser Ala 

100 105 HO 

Glu Arg He He Thr Pro Asp Thr Thr He Leu He Asp Tyr Met Leu 

115 120 125 

Asn Arg Phe Gly Asn He Val Lys Asn Leu Thr Val Phe Pro Glu Asn 

130 135 140 

Met He Arg Asn Met Asn Ser Thr Phe Gly Leu He Phe Ser Gin Arg 
145 150 155 160 

Ala Met Leu Thr Leu He Glu Lys Gly Met Thr Arg Glu Gin Ala Tyr 

165 170 175 

Asp Leu Val Gin Pro Lys Thr Ala Tyr Ser Trp Asp Asn Gin Val Asp 

180 185 190 

Phe Lys Pro Leu Leu Glu Ala Asp Ser Glu Val Thr Ser Arg Leu Thr 

195 200 205 

Gin Glu Glu He Asp Glu He Phe Asn Pro Val Tyr Tyr Thr Lys Arg 

210 215 220 

Val Asp Asp He Phe Glu Arg Leu Gly Leu Gly Asp 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:301: 

Met Leu Asn Leu Thr His Val Thr Leu Lys Thr Arg Gin Val lie Leu 

15 10 15 

Gin Asp Ala Asp Phe Thr Phe Lys Lys Gly Arg lie Tyr Gly Leu Leu 

20 25 30 

Ala He Asn Gly Ser Gly Lys Thr Thr Leu Phe Arg Ala Met Ser Lys 

35 40 45 

Leu Leu Pro Leu Ser Ser Gly His He Ala Val Pro Pro Ser Leu Phe 

50 55 60 

Tyr Tyr Glu Ser Val Glu Trp Leu Asp Gly Asn Leu Ser Gly Met Asp 
65 70 75 80 

Tyr Leu Arg Leu He Lys Lys His Leu Glu Val Arg Pro Lys Leu Glu 
85 90 95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 

Met Tyr Thr Leu Gly Gin Pro Asn Glu Leu Ala Ala Glu Phe Leu Asn 

1.5 10 15 

Phe Val Leu Ser Asp Glu Thr Gin Glu Gly He Val Lys Gly Leu Lys 

20 25 30 

Tyr lie Pro lie Lys Glu Met Lys Val Glu Lys Asp Ala Ala Gly Thr 

35 40 45 

Val Thr Val Leu Glu Gly Arg Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO: 3 03: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: 

Met Asn Gin Glu Glu Leu Ala Lys Lys Met Leu Leu Pro Ser Lys Asn 

1 5 io 15 

Ser Arg Leu Glu Lys Leu Gly Lys Gly Leu Thr Phe Ala Cys Leu Ser 

20 25 30 

Leu lie Val lie Leu Val Ala Met lie Leu Val Phe Val Ala Gin Lys 

35 40 45 

Gly Leu Ser Thr Phe Phe Val Asn Gly Val Asn He Phe Asp Phe Leu 

50 55 60 

Leu Gly Gly Thr Trp Asn Pro Ser Ser Lys Glu Phe Gly Ala Leu Pro 
65 70 75 80 

Met He Leu Gly Ser Phe He Val Thr He Leu Ser Ala Leu He Ala 

85 90 95 

Thr Pro Phe Ala He Gly Ala Ala Val Phe Met Thr Glu Val Ser Pro 

100 105 no 

Lys Gly Ala Lys He Leu Gin Pro Ala He Glu Leu Leu Val Gly He 

115 120 125 

Pro Ser Val Val Tyr Gly Phe He Gly Leu Gin Val Val Val Pro Phe 

130 135. i 4 o 

Val Arg Ser Val Phe Gly Gly Thr Gly Phe Gly He Leu Ser Gly He 
145 ISO 155 160 

Ser Val Leu Phe Val Met He Leu Pro Thr Val Thr Phe Met Thr Thr 

165 170 175 

Asp Ser Leu Arg Ala Val Pro Xaa Leu Leu Ser 
180 185 

(2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

Met Thr Glu lie Arg Leu Glu His Val Ser Tyr Ala Tyr Gly Gin Glu 

15 10 15 

Arg lie Leu Glu Asp lie Asn Leu Gin Val Thr Ser Gly Glu Val Val 

20 25 30 

Ser lie Leu Gly Pro Ser Gly Val Gly Lys Thr Thr Leu Phe Asn Leu 

35 40 45 

lie Ala Gly He Leu Glu Val Gin Ser Gly Arg He Val Leu Asp Gly 

50 55 60 

Glu Glu Asn Pro Lys Gly His Val Ser Tyr Met Leu Gin Lys Asp Leu 
65 70 75 80 

Leu Leu Glu His Lys Thr Val Leu Gly Asn He He Leu Pro Leu Leu 

85 90 95 

He Gin Lys Val Asp Lys Ala Glu Ala lie Ser Arg Ala Asp Lys He 

100 105 110 

Leu Ala Thr Phe Gin Leu Thr Ala Val Arg Asp Lys Tyr Pro His Glu 

115 120 125 

Leu Ser Gly Gly Met Arg Gin Arg Val Ala Leu Leu Arg Thr Tyr Leu 

130 135 140 

Phe Gly His Lys Leu Phe Leu Leu Asp Glu Ala Phe Ser Ala Leu Asp 
145 150 155 160 

Glu Met Thr Lys Met Glu Leu His Ala Trp Tyr Leu Glu He His Lys 

165 170 175 

Gin Leu Gin Leu Thr Thr Leu lie Xaa Thr His Ser He Glu Glu Ala 

180 185 190 

Leu Xaa Leu Ser Asp Arg He Tyr He Leu Xaa Asn Xaa Pro Gly Gin 

195 200 205 

He Val Ser Glu He Lys Leu Asp Trp Ser Glu Asp Glu Asp Xaa Glu 

210 215 220 

Val Xaa Lys He Ala Xaa Lys Arg Gin He Leu Ala Glu Leu Gly Leu 
225 230 235 240 

Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

Met Glu His Pro Xaa Glu Ala Ala Asp lie Leu lie Lys Asn Ala Pro 

15 10 15 

Glu Leu Lys Glu Lys Arg Asp Phe Val lie Glu Ser Gin Lys Tyr Leu 

20 25 30 

Ser Lys Glu Tyr Ala Ser Asp Lys Glu Lys Trp Gly Gin Phe Asp Ala 

35 40 45 

Ala Arg Trp Asn Ala Phe Tyr Lys Trp Asp Lys Glu Asn Gly lie Leu 

50 55 60 

Lys Glu Asp Leu Thr Asp Lys Gly Phe Thr Asn Glu Phe Val Lys 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 3 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

Met Val Tyr Thr Ser Leu Ser Ser Lys Asp Gly Asn Tyr Pro Tyr Gin 

1 5 10 15 

Leu Asn lie Ala His Leu Tyr Gly Asn Leu Met Asn Thr Tyr Gly Asp 

20 25 30 

Asn Gly Asn lie Leu Met Leu Lys Tyr Val Ala Glu Lys Leu Gly Thr 

35 40 45 

His Val Thr Val Asp He Val Ser Leu His Asp Asp Phe Asp Glu Asn 

50 55 60 

His Tyr Asp He Ala Phe Phe Gly Gly Gly Gin Asp Phe Glu Gin Ser 
65 70 75 80 

He He Ala Asp Asp Leu Pro Ala Lys Lys Glu Ser He Asp Asn Tyr 

85 90 95 

He Gin Asn Asp Gly Val Val Leu Ala He Cys Gly Gly Phe Gin Leu 

100 105 no 

Leu Gly Gin Tyr Tyr Val Glu Ala Ser Gly Lys Arg He Glu Gly Leu 
115 120 125 
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Gly Val Met Gly His 
130 

Gly Asp lie Lys lie 
145 



Tyr Thr Leu Asn Gin Thr 
135 

His Asn Glu Arg Phe Arg 
150 155 
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Asn Asn Arg Phe lie 
140 



(2) INFORMATION FOR SEQ ID NO: 307: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



Met His Gin His Ser Pro His Thr 

1 5 
Ala Met Ser Lys Tyr Pro Glu Arg 
20 

Asn Glu Glu Ser Asp Phe Glu Leu 

35 40 
He Leu Asn Phe Lys Phe Val Gin 
50 55 



Ala Ser Tyr Thr Ser Ser Arg Asp 

10 15 
Cys Thr Thr Val Gly Leu Arg Leu 
25 30 
Tyr Ala Pro Tyr Gly Leu Glu Asp 
45 

Leu Leu He Ser 
60 



(2) INFORMATION FOR SEQ ID NO: 308: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: 



Met Phe Glu He Phe Asn Met Gly Val Gly Leu Met Leu Ala Val Ser 

15 10 15 

Pro Glu Asn Val Glu Arg Val Lys Glu Leu Leu Asp Glu Ala Val Tyr 

20 25 30 

Glu He Gly Arg He Val Lys Lys Glu Asn Glu Ser Val He He Lys 

35 40 45 



244 



WO 97/37026 



PCT/US97/05306 



(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Met Met Val Lys Lys His Tyr Phe Lys Asp Asn Glu Glu Asp Ser Lys 

1 5 10 15 

Met Arg Arg lie Leu Leu Leu Val Ala Val Leu Leu Met lie Pro Ser 

20 25 30 

Phe lie Ser Ala Thr Thr Leu Val Arg Glu Thr Leu Lys Lys Glu Ser 

35 40 . 45 

Leu Lys Lys Phe He Ser Glu Gin Phe Gin Gly His Asn He Leu Lys 

50 55 60 

Lys Thr Tyr Ser Lys Lys Thr His Thr Leu Lys Leu Thr He Ser Gly 
65 70 75 80 

Asn Tyr Leu Thr Glu Glu Glu Leu Asp Met He Ser Ser Lys Arg Gly 

85 90 95 

Asp Tyr Gly Leu Ser Asp Val Ser Val Gin Val Ser Gin Leu Ser Asp 

100 105 no 

Ser Glu Gin Leu Ser Lys Glu Glu Leu Val Glu Tyr Phe Phe Gin Tyr 

115 120 125 

He Lys Asp Lys Glu Ala Lys Glu Lys Glu Lys Ala Asn Lys Phe Tyr 

130 135 140 

Thr Glu Ser Glu Glu Gin 
145 150 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:310: 



Met Ser lie Glu Pro Arg Arg Arg Trp Arg Asn Ser Thr Val Cys Leu 

15 10 15 

Glu Ala Glu Leu Tyr Gin Glu Ser Leu Val Leu lie Gly Gly Asp Pro 

20 25 30 

Gly lie Gly Lys Ser Thr Leu Leu Leu Gin Val Ser Thr Gin Leu Ser 

35 40 45 

Gin Val Gly Thr Val Leu Tyr Val Ser Gly Glu Glu Ser Ala Gin Gin 

50 55 60 

He Lys. Leu Arg Ala Glu Arg Leu Gly Asp Phe Asp Ser Glu Phe Tyr 
65 70 75 80 

Leu Tyr Ala Glu Thr Asn Met Gin Ser Val Arg Ala Glu Val Val Ala 

85 90 95 

Tyr Pro Ala Arg Leu Ser His Tyr 
100 

(2) INFORMATION FOR SEQ ID NO: 311: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



Met Ala Thr Lys Gin Lys Glu Val Thr Thr Phe Asp Val Gin Val Ala 

15 10 15 

Glu Phe He Arg Asn His Lys Gin Lys Gly Thr Ala Thr Asp Asp Glu 

20 25 30 

He Asn Ala Ser Leu Val He Pro Phe Thr Leu Asp Ala Asp Gly lie 

35 40 45 

Glu Asp Leu Leu Gin Arg He Gin Asp Ala Gly He Ser He Thr Asp 

50 55 60 

Asn Glu Gly Asn Pro Ser Ala Arg Val Leu Ser Asn Glu Glu Glu Pro 
65 70 75 80 

Glu Leu Ser Asp Glu Asp Leu He Gly Ser Thr Ser Ala Lys Val Asn 

85 90 95 

Asp Pro Val Arg Met Tyr Leu Lys Glu He Gly Val Val Pro Leu Leu 
100 105 110 
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Thr Asn Glu Glu Glu 
115 

lie Glu Ala Lys Gin 
130 

Ser He Ala Lys Arg 
145 



Lys Glu Leu Ala Leu Ala 
120 

Arg Leu Ala Glu Ala Asn 
135 

Tyr Val Gly Arg Gly Xaa 
150 155 
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Val Glu Ala Gly Asp 
125 

Leu Arg Leu Val Val 
140 

Gin Ser Leu Thr 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



Met Met Leu Lys Pro Ser He Asp Thr Leu Leu Asp Lys Val Pro Ser 

Lys Tyr Ser Leu Val He Leu Glu Ala Lys Arg Ala His Glu Leu Glu 

20 25 30 

Ala Gly Ala Pro Ala Thr Gin Gly Phe Lys Ser Glu Lys Ser Thr Leu 

35 40 45 

Arg Ala Leu Glu Glu He Glu Ser Gly Asn Val Thr He His Pro Asp 

50 55 60 

Pro Glu Gly Lys Arg Glu Ala Val Arg Arg Arg He Glu Glu Glu Lys 
65 ™ 75 80 

Arg Arg Lys Glu Glu Glu Glu Lys Lys lie Lys Glu Gin He Ala Lys 

85 90 95 

Glu Lys Glu Asp Gly Glu Lys lie 
100 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: 



Met lie Arg Gin Gly Gin Met Leu Glu Tyr Ala Glu Tyr Val Gly Asn 

15 10 15 

Tyr Tyr Gly Thr Pro Leu Thr Tyr Val Asn Glu Thr Leu Asp Lys Gly 

20 25 30 

He Asp Val Phe Leu Glu He Glu Val Gin Gly Ala Leu Gin Val Lys 

35 40 45 

Lys Lys Val Pro Asp Ala Val Phe He Phe Leu Thr Pro Pro Asp Leu 

50 55 60 

Asp Glu Leu Gin Glu Arg Leu Val Gly Arg Gly Thr Asp Ser Ala Glu 
65 70 75 80 

Val He Ala Gin Arg He Glu Lys Ala Lys Glu Glu He Ala Leu Met 

85 90 95 

Arg Glu Tyr Asp Tyr Ala He Val Asn Asp Gin Val Pro Leu Ala Ala 

100 105 110 

Glu Arg Val Lys Cys Val He Glu Ala Glu His Phe Cys Val Asp Arg 

115 120 125 

Val He Gly His Tyr Gin Glu Met Leu Pro Lys Ser Pro Thr Thr Arg 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Met Ser Arg Leu Leu Val He Gly Cys Gly Gly Val Ala Gin Val Ala 

15 10 15 

lie Ser Lys He Cys Gin Asp Ser Glu Thr Phe Thr Glu lie Met lie 

20 25 30 

Ala Ser Arg Thr Lys Ser Lys Cys Asp Asp Leu Lys Ala Lys Leu Glu 

35 40 45 

Gly Lys Thr Ser Thr Lys lie Glu Thr Ala Ala Leu Asp Ala Asp Lys 

50 55 60 

Val Glu Glu Val lie Ala Leu lie Glu Ser Leu Gin Thr Ser Lys Leu 
65 70 75 80 

Phe 
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(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B ) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

Met His Glu Ser Phe His Val Asp Asp Pro Thr Leu Tyr Ser Arg Glu 

15 10 15 

Trp Phe Ser Trp Ala Asn Met Met Phe Cys Glu Leu Val Leu Asp Tyr 

20 25 30 

Leu Asp He Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

Met Arg Phe Tyr Phe Met Glu Asn Val Val Val His He He Ser His 

1 5 10 15 

Ser His Trp Asp Arg Glu Trp Tyr Leu Pro Phe Glu Ser His Arg Met 

20 25 30 

Gin Leu Val Glu Leu Phe Asp Asn Leu Phe Asp Leu Phe Glu Asn Asp 

35 40 45 

Pro Glu Phe Lys Ser Phe His Leu Asp Gly Gin Thr He Val Leu Asp 
50 55 60 

Asp 
65 
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(2) INFORMATION FOR SEQ ID NO: 317: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 17: 



Met Leu Leu 
1 

Thr Cys Leu 

lie Ser Leu 
35 

Trp Met Asp 
50 

Glu Asn Arg 
65 

lie Gly Pro 

Ala Asn Val 

Gly Lys Ser 
115 

Gly Gin Ala 
130 

Phe Gly Arg 
145 

Asp Glu Gin 

Asp Gly Ser 

Gly Asn Glu 
195 

Gin Lys Leu 

210 
Asp Asp Glu 
225 

Arg Ser His 
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Gly He Val Ser Gly 
15 

Trp Asn Cys Leu Thr 
30 

Ser Arg Val Ser Thr 
45 

Leu Gin He Arg Pro 
60 

Glu Gly Lys Leu Lys 
80 

Leu He Ser Ser Glu 
95 

Glu Ala Ala Lys Trp 
110 

Thr Phe Gly Asn Met 
125 

He His Val Ala Ala 
140 

Asn Gin Val Leu Glu 
160 

Tyr Trp Gin Gly Val 
175 

Asn Trp Tyr Ser Asn 
190 

Leu Thr Phe Trp Lys 
205 

Asp Gin Pro Met Val 
220 

Gin Lys Lys Ser Glu 
240 

Arg Met 

250 
250 



Tyr lie Leu Ser 
5 

Leu Lys Ala He 
20 

He Ser Leu Lys 

Lys Leu Leu Ser 
55 

Asp Lys Val Gin 
70 

Phe Tyr lie Leu 
85 

Arg Asn Thr Leu 
100 

Thr Gin He Gly 

Pro Gin He Leu 
135 

Gly Val Lys Pro 
150 

Phe Thr Ser Gin 
165 

Arg Val Leu Gly 
180 

He Pro Val Asp 

Ser Asp Val Arg 
215 

Thr Ala Val Asn 
230 

Ser Val Trp Gin 



His lie Ala Thr 
10 

Val Cys Asn Trp 
25 

Met Thr Leu Ser 
40 

Leu Thr Thr Asn 

Arg Tyr lie Asp 
75 

Gin Asp Asp Tyr 
90 

He Gly Gin Gin 
105 

Tyr Phe Pro Asp 
120 

Gin Lys Ser Gly 

lie Gly Phe Asp 
155 

Phe Ser Glu Met 
170 

lie Leu Phe Ala 
185 

Lys Asp Glu Ala 
200 

Cys Leu Arg Phe 

Thr Ser Leu Ser 
235 

Met Asn Ser Ser 
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(2) INFORMATION FOR SEQ ID NO: 3 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



Met Thr lie Val Gly Cys Arg lie 

1 5 
Val Ala Asn Leu Trp Ala Gly Lys 
20 

Val Asp Asp Glu Val Val Asn Asn 

35 40 
Leu Ala Thr Pro Pro Gly Val Lys 

50 55 
Ala Ala Ala Asn lie Leu Ala Gly 
65 70 
lie Val Ala Arg Lys Pro Asp Arg 
85 

Cys Thr Thr 



Asp Gly Arg Leu He His Gly Gin 

10 15 
Leu Asn Val Ser Arg He Met Val 
25 30 
Asp He Glu Lys Ser Gly Leu Lys 
45 

Leu Ser He Leu Pro Val Glu Lys 
60 

Lys Tyr Asp Ser Gin Arg Leu Phe 

75 80 
Phe Pro Trp Phe Gly Arg Ser Arg 
90 95 



(2) INFORMATION FOR SEQ ID NO: 3 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



Met He Gin His Pro Arg He Gly He Arg Pro Thr He Asp Gly Arg 

1 5 io 15 

Arg Gin Gly Val Arg Glu Ser Leu Glu Val Gin Thr Met Asn Met Ala 
20 25 30 
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Lys Ser Val Ala Asp Leu lie Ser Ser Thr Leu Lys Tyr Pro Asp Gly 

35 40 45 

Glu Pro Val Glu Cys Val He Ser Pro Ser Thr lie Gly Arg Val Pro 

50 55 60 

Glu Ala Ala Ala Ser His Glu Leu Phe Lys Lys Ser Asn Val Cys Ala 
65 70 75 80 

Thr He Thr Val Thr Pro Cys Trp Cys Tyr Gly Ser Glu Thr Met Asp 

85 90 95 

Met Ser Pro Asp He Pro His Ala He Trp Gly Phe Asn Gly Thr Glu 

100 105 HO 

Arg Pro Gly Ala Val Tyr Leu Ala Ala Val Leu Ala Ser His Ala Gin 

115 120 125 

Lys Gly lie Pro Ala Phe Gly He Tyr Gly Arg Asp Val Gin Glu Ala 

130 135 140 

Asn Asp Thr Asp He Pro Glu Asp Val Lys Glu Asn Phe 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

Met Leu Leu Ser Thr Lys Gly He Glu Glu Gin Glu He Ala Arg Pro 

15 10 15 

Thr Leu Glu Arg Leu Phe Ser Met Arg Glu Asn Tyr Lys Val Thr Gly 

20 25 30 

Arg His Pro Gly Tyr Arg Lys Tyr Asn Gly Asp Gly Ser Met Lys Glu 

35 40 45 

Thr Glu Lys 
50 

(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:321: 

Met Leu Thr Leu Thr Pro Asp Val lie Tyr Gin Glu Pro Asp lie Leu 

15 10 15 

Tyr Gin His Glu Asp Phe Ser Leu Val Lys He Arg Gin He Arg Phe 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO:322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Met Ser Ser Val Val Gly Leu Met Gly Asn He Gly Gin Ala Asn Tyr 

15 10 15 

Ala Ala Ser Lys Ala Gly Leu He Gly Phe Thr Lys Ser Val Ala Arg 

20 25 30 

Glu Val Ala Ser Arg Asn He Arg Val Asn Val He Ala Pro Gly Asn 
35 40 45 

Asp 



(2) INFORMATION FOR SEQ ID NO: 323 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein . 
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(xi) SEQUENCE DESCRIPTION: 
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SEQ ID NO: 323: 



Met Lys Leu Glu His Lys Asn lie Phe lie Thr Gly Ser Ser Arg Gly 

1 5 10 15 

lie Gly Leu Ala lie Ala His Lys Phe Ala Gin Ala Gly Ala Asn lie 

20 25 30 

Val Leu Asn Ser Arg Gly Ala lie Ser Glu Glu Leu Leu Ala Glu Phe 

35 40 45 

Ser Asn Tyr Gly He Lys Val Val Pro He Ser Gly Asp Val Ser Asp 

50 55 60 

Phe Ala Asp Ala Lys Arg Met He Asp Gin Ala He Ala Glu Leu Gly 
65 70 75 80 

Ser Val Asp Val Leu Val Asn Asn Ala Gly lie Thr Gin Asp Thr Leu 

85 90 95 

Met Leu Lys Met Thr Glu Ala Asp Phe Glu Lys Val Leu Lys Val Asn 

100 105 110 

Leu Thr Gly Ala Phe Asn Met Thr Gin Ser Val Phe Gly Asn Arg 
115 120 125 



<2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 



Met Glu Ser Met Pro Arg He Gly Leu Leu Val Thr Val Glu Asn Arg 

15 10 15 

Asp Thr Glu Thr He Phe Asn Ala Ala Gly Leu Asp Phe Asp Val Leu 

20 * 25 30 

Lys Ala Ser Ala He Ala Tyr He Asn Ala Asn Thr Phe Val Gin Lys 

35 40 45 

Glu Asn Ala Gly Glu Met Gly Arg Ser Val Ser Tyr His Asp Met Arg 

50 55 60 

Ser Val 
65 

(2) INFORMATION FOR SEQ ID NO: 325: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 



Met Arg Gin Gly Gin Gly Ser Val Glu Ala lie Phe Asn Ala He Asp 

15 10 X5 

Lys Phe Phe Asn Gin Ser Val Arg Leu Val Ser Tyr Thr He Asn Ala 

20 25 30 

Val Thr Asp Gly He Asp Ala Gin Asp Arg Val Val Gly His Cys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 6: 

Met Val Glu. Asn Pro Glu Gly Phe His Phe Asp Asp Leu Gin Leu Gin 

15 10 15 

Thr His Ala Asp Asn Asp He Glu Ala Leu Val Ser Leu Ala Asn Met 

20 25 30 

Asp Gly Glu Lys Val Glu Phe Asn Ala Thr Gly Thr Gly Phe Arg 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 210 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 



Met Asn Phe Gin Leu Ala Lys Tyr Ser Leu Leu Lys Lys Phe Ser Glu 

15 10 15 

Asn lie Gly Phe Thr Thr Pro Glu Glu Cys Gly Ala lie Phe Gin Tyr 

20 25 30 

Leu lie Glu Asn Val Gin Thr Asp Arg Gin lie lie Tyr Ser Pro Pro 

35 40 45 

Cys His Asp Glu Leu Arg Met Ala Val Ala Asn Ser Leu Ala Ala Val 

50 55 60 

Lys Asn Gly Ala Gly Leu Phe Glu Glu Thr lie His Gly lie Arg Glu 
65 70 75 80 

Arg Ala Glu Asn Ala Ala Leu Glu Glu lie Ala Val Ala Leu Asn lie 

85 90 95 

Arg Gin Asp Tyr Tyr Gin Val Glu Thr Ser lie Val Leu Asn Glu Thr 

100 105 110 

lie Asn Thr Ser Glu Met Val Ser Arg Phe Ser Gly lie Pro Val Pro 

115 120 125 

Lys Asn Lys Ala Val Val Gly Gly Asn Thr Phe Ser His Glu Ser Gly 

130 135 140 

lie His Gin Asp Gly Val Leu Lys Asn Pro Leu Thr Tyr Glu lie lie 
145 150 155 160 

Thr Pro Glu Leu Val Gly Val Lys lie Pro Leu Gly Lys Leu Ser Gly 

165 170 175 

Arg His Ala Phe Val Glu Lys Leu Arg Glu Leu Ala Leu Asp Phe Thr 

180 185 190 

Glu Glu Asp lie Lys Pro Phe Phe Ala Lys Phe Lys Ala Leu Ala Asp 
195 200 205 

Lys Lys 
210 



(2) INFORMATION FOR SEQ ID NO: 328: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 
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Met Ser Gin Gin Val Lys Asn Ala 

1 5 
Gin Asp Gly Arg Val Ala Glu Ala 
20 

Tyr He Gin His Ser Thr Gly Val 

35 40 
Ala Phe Phe Ala Asp Phe Phe Glu 

50 55 
He Val Arg Thr He Glu Asp Gly 
65 70 
Ser He Ser Glu Trp Trp Arg Ser 
85 

Pro Cys Gly 



His Asn Leu Tyr He His Ala He 

10 15 
Gin Ala Gin Ser Val Gly Asp Thr 
25 30 
Pro Asp Gly Lys Glu Gly Phe Ala 
45 

Arg His Pro Glu Arg Gin He Lys 
60 

Asn Leu Val Phe Val His Val Pro 

75 80 
Ser Met Gly Asp Asp Gly Tyr Phe 
90 95 



(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
fB) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Glu Met Gin Thr Ser Ala Gin Leu Leu Thr Asn Lys He Phe Leu 

1 5 10 15 

Lys Asn Pro Leu Lys Ala Leu Val Glu Glu Lys Tyr Gly He Glu Tyr 

20 25 30 

Glu Glu Phe Thr Asn Pro Trp His Ala Ala He Ser Ser Phe Val Ala 

35 40 45 

Phe Phe Leu Arg Ser Leu Pro Pro Met Leu Ser Val Thr He Phe Pro 

50 55 so 

Ser Glu Tyr Arg He Pro Ala Thr Val Leu lie Val Gly Val Ala Leu 

65 70 -7c 

/u 75 80 

Leu Leu Thr Gly Tyr Thr Ser Ala Arg Leu Gly Lys Asp Pro Thr Arg 

85 90 95 

Thr Ala Met He Arg Asn Leu Ala He Gly Leu Leu Thr Met Gly Val 

100 105 no 

Thr Phe Leu Leu Glu Gin Leu Phe Ser lie 
115 120 



257 



WO 97/37026 



PCT/US97/Q5306 



(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

Met Ala Phe Phe Leu Cys Pro His Phe Arg Ser Asn His Trp Lys lie 

1 5 10 . 15 

Leu Thr Val Ser Asp Thr Met Glu Glu Lys Arg Leu Glu Tyr Pro Met 

20 25 30 

Val Thr Phe Leu Gly Asn Pro Val Ser Phe Thr Gly Lys Gin Leu Gin 

35 40 45 

Val Gly Asp Lys Ala Leu Asp Phe Ser Leu Thr Thr Thr Asp Leu Ser 

50 55 60 

Lys Lys Ser Leu Ala Asp Phe Asp Gly Lys Lys Lys Val Leu Ser Val 
65 70 75 80 

Val Pro Ser lie Asp Thr Gly lie Cys Ser Thr Gin Thr Arg Arg Phe 

85 90 95 

Asn Glu Glu Leu Ala Gly Leu Asp Asn Thr Val Val Leu Thr Val Ser 

100 105 110 

Met Asp Leu Pro Phe Ala Gin Lys Arg Trp Cys Gly Ala Glu Gly Leu 

115 120 125 

Asp Asn Ala He Met Leu Ser Asp Tyr Phe Asp His Ser Phe Gly Arg 

130 135 140 

Asp Tyr Ala Leu Leu He Asn Glu Trp His Leu Leu Ala Arg Ala Val 
145 150 155 160 

Phe Val Leu Asp Thr Asp Asn Thr He Arg Tyr Val Glu Tyr Val Asp 

165 170 175 

Asn He Asn Ser Glu Pro Asn Phe Glu Ala Ala He Ala Ala Ala Lys 
180 185 190 

Ala Leu 



(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 48 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 



Met Gly Tyr Arg Pro Ser Thr Ala 

1 5 
Glu Leu Leu Val Ser Arg Gly Tyr 
20 

Met Val Val Pro Lys Ser Val Val 
35 40 



Asn Ala lie He His Gin Val Arg 

10 15 
Thr Phe Tyr Asn Arg Lys Arg Leu 
25 30 
Lys Glu Leu Leu Gly Met Glu Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



Met Gin Ala Val Glu His Phe He Lys Gin Phe Val Pro Glu His Tyr 

1 5 10 is 

Asp Leu Phe Leu Asp Leu Ser Arg Glu Thr Lys Thr Phe Ser Gly Lys 

20 25 30 

Val Thr He Thr Gly Gin Ala Gin Ser Asp Arg He Ser Leu His Gin 

35 40 45 

Lys Asp Leu Glu He Thr Ser Val Glu Val Ala Gly Gin Ala Arg Pro 

50 55 60 

Phe Thr Val Asp His Asp Asn Glu Ala Leu His He Glu Leu Ala Glu 
65 70 75 so 

Ala Gly Gin Val Glu Leu Val Leu Ala Phe Ser Gly Lys lie Thr Asp 

85 90 95 

Asn Met Thr Gly He Tyr Pro Ser Tyr Tyr Thr Val Asp Gly Val Lys 

100 105 110 

Lys Glu Val Leu Ser Thr Gin Phe Glu Ser His Phe Ala Arg Glu Ala 
115 120 125 
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Phe Pro Cys Val Asp Glu Pro Glu 

130 135 
Leu Arg Phe Asp Gin Ala Glu Gly 
145 150 
Glu lie Asp Val Glu Asn Arg Lys 
165 

Thr Thr Pro Arg Met Ser Ser Tyr 
180 

Leu Gin Gly Val Thr Ala Lys Thr 
195 200 
Leu Leu Asn Gin Ser Thr Ser Thr 
210 215 



PCT/US97/05306 

Ala Lys Ala Thr Phe Asp Leu Ser 
140 

Glu Leu Ala Leu Ser Asn Met Pro 
155 160 
Glu Thr Gly lie Trp Lys Phe Glu 

170 175 
Leu Leu Ala Phe Val Ala Gly Asp 
185 190 
Lys Asn Gly Thr Leu Val Gly Cys 
205 

Phe Lys Ser 



(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: 



Met Ser Gin Glu Phe Leu Ala Arg 

1 5 
Val Glu Gin Met Lys Leu Glu Gin 
20 

Arg Leu Ala Glu Phe Leu Lys Asn 

35 40 
Ala Glu Val Lys Lys Ala Ser Pro 

50 55 
Met Trp lie Leu Cys Asn Arg Pro 
65 70 



lie Leu Glu Gin Lys Ala Arg Glu 

10 15 
lie Gin Pro Leu Arg Gin Thr Tyr 
25 30 
His Gin Asp Arg Leu Gin Val He 
45 

Ser Phe Gly Glu He Ser He Ser 
60 

Arg Leu Met Lys Lys Thr Glu Gin 
75 80 



(2) INFORMATION FOR SEQ ID NO: 334: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: 

Met Ser Ser Phe Thr Pro Glu Asp Leu Gly Met Glu Gly Tyr Ala Met 

15 10 15 

Glu Asp He Arg Gly Gly Asn Ala Gin Glu Asn Ala Glu He Leu Leu 

20 25 30 

Ser Val Leu Lys Asn Glu Ala Ser Pro Phe Leu Glu Thr Thr Val Leu 

35 40 45 

Asn Ala Gly Leu Gly Phe Tyr Ala Asn Gly Lys He Asp Ser He Lys 

50 55 60 

Glu Gly Val Ala Leu Ala Arg Gin Val He Ala Arg Gly Lys Ala Leu 
65 70 75 80 

Glu Lys Leu Arg Leu Leu Gin Glu Tyr Gin Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met He Tyr Thr Val Thr Leu Asn Pro Ser He Asp Tyr He Val Arg 

15 10 15 

Leu Asp Gin Val Lys Val Gly Ser Val Asn Arg Met Asp Ser Asp Asp 

20 25 30 

Lys Phe Ala Gly Gly Lys Gly He Asn Val Ser Arg Val Leu Lys Arg 

35 40 45 

Leu Asn He Ser Asn Thr Ala Thr Gly Phe He Gly Gly Phe Thr Gly 

50 55 60 

Lys Phe Xaa Xaa Asp Thr Leu Ala Glu Glu Glu He Glu Xaa Arg Phe 
65 70 75 80 

Val Gin Val Ala Glu Asp Thr Arg He Asn Val Lys He Xaa Ala Asp 

85 go 95 

Gin Glu Thr Glu He Asn Gly Thr Gly Pro Thr Val Glu Pro Val Lys 
100 105 no 
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Leu Glu Glu Leu Lys Ala lie Leu 
115 120 
Val Val Phe Ala Gly Ser Ser Ala 

130 135 
Lys Gly Phe Asn Leu Leu Asp Ala 
145 150 
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Ser Ser Leu Thr Ala Glu Asp Thr 
125 

Lys Asn Leu Gly Asn Val lie Tyr 
140 

Pro Asp Trp Cys Ala Ser Gly Leu 
155 160 



(2) INFORMATION FOR SEQ ID NO: 336: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 36: 



Met Met Gin Val Tyr Phe Asp Gin Gly He Tyr Asn Lys Lys Ala Val 

15 10 15 

Phe Glu Val Tyr Phe Arg Gin Gin Pro Phe Lys Asn Gly Tyr Ala Val 

20 25 30 

Phe Ala Gly Leu Glu Arg He Val Asn Tyr Leu Glu Asp Leu Arg Phe 

35 40 45 

Ser Asp Ser Asp He Ala Tyr Leu Glu Ser Leu Gly Tyr His Gly Ala 

50 55 60 

Phe Leu Asp Tyr Leu Arg Asn Phe Lys Leu Glu Leu Thr Val Arg Ser 
65 70 75 80 

Ala Gin Glu Gly Asp Leu Val Phe Ala Asn Glu Pro He Val Gin Val 

85 90 95 

Glu Gly Pro Leu Ala Gin Cys Gin Leu Val Glu Thr Ala Leu Leu Asn 

100 105 110 

He Val Asn Tyr Gin Thr Leu Val Ala Thr Lys Ala Ala Pro Tyr Pro 

115 120 125 

Phe Gly Tyr Arg Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

Met Lys He He He Gin Arg Val Lys Lys Ala Gin Val Ser He Glu 

15 10 15 

Gly Gin He Gin Gly Lys He Asn Gin Gly Leu Leu Leu Leu Val Gly 

20 25 30 

Val Gly Pro Glu Asp Gin Glu Glu Asp Leu Asp Tyr Ala Val Arg Lys 

35 40 45 

Leu Val Asn Met Arg lie Phe Ser Asp Val Glu Gly Lys Met Asn Leu 

50 55 60 

Ser Val Lys Asp He Glu Gly Glu He Leu Ser lie Leu Ser Leu Pro 
65 70 75 80 

Leu Cys Gly Tyr 



(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

Met lie Lys Gly Thr He Lys Thr Val Ser He Met Ala Ala Thr Ser 

15 10 15 

Gly Phe Leu Leu Tyr Asn Glu Val Phe Phe Leu Thr Asn Gly Ala Ala 

20 ' 25 30 

Gly Thr Lys Ser lie Ser Phe Val lie Arg Glu Leu Ala Val Ala Ser 

35 40 45 

Ser Arg Thr Gin Tyr Ala Arg Ala Asn Thr lie Gly Val lie Gin He 

50 55 60 

Leu Gly Gly Met Leu He He Val Cys lie Asn lie Leu Phe Arg Glu 
65 70 75 80 

Arg Lys Arg Leu Lys Gly Gly Lys 
85 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

Met Gly Phe Gly He Pro Ala Ala He Gly Ala Lys He Ala Asn Pro 

15 io 15 

Asp Lys Glu Val Val Leu Phe Val Gly Asp Gly Gly Phe Gin Met Thr 

20 25 30 

Asn Gin Glu Leu Ala He Leu Asn He Tyr Lys Val Pro He Lys Val 

35 40 45 

Val Met Leu Asn Asn His Ser Leu Gly Met Val Arg Gin Trp Gin Glu 

50 55 60 

Ser Phe Tyr Glu Gly Arg Thr Ser Glu Ser Val Phe Asp Thr Leu Pro 
65 70 75 80 

Asp Phe Gin Leu Met Ala Gin Ala Tyr Gly He Lys Asn Tyr Lys Phe 

85 90 95 

Asp Asn Pro Glu Thr Leu Ala Gin Asp Leu Glu Ala Thr Thr Glu Asp 

100 105 no 

Val Pro Met Leu He Glu Val Asp He Ser Arg Lys Glu Gin Val Leu 

115 120 125 

Pro Met Val Pro Ala Gly Lys Ser Asn His Glu Met Leu Gly Val Lys 

130 135 140 

Phe His Ala 
145 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
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Met Trp Leu Pro Leu Asp Arg Asn Asn Lys Gly Gin Asn lie Ala Gly 

Ala Arg Gin Ala Ala Glu Gly lie Phe Gly Val Asp Ala Ser Gin Leu 

20 25 30 

Thr Val Pro Gin Ala Ala Phe Leu Ala Gly Leu Pro Gin Ser Pro lie 

35 40 45 

Thr Tyr Ser Pro Tyr Glu Asn Thr Gly Glu Leu Lys Asn 
50 55 eo 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 107 amino acids 
IB) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

Met Tyr Leu Gly Asp Leu Met Glu Lys Ala Glu Cys Gly Gin Phe Ser 

15 10 15 

He Leu Ser Phe Leu Leu Gin Glu Ser Gin Thr Thr Val Lys Ala Val 

20 25 30 

Met Glu Glu Thr Gly Phe Ser Lys Ala Thr Leu Thr Lys Tyr Val Thr 

35 40 45 

Leu Leu Asn Asp Lys Ala Leu Asp Ser Gly Leu Glu Leu Thr He His 

50 55 60 

Ser Glu Asp Glu Asn Leu Arg Leu Ser He Gly Ala Ala Thr Lys Gly 

65 70 tc 

/u 75 80 

Arg Asp lie Pro Glu Leu Val Phe Trp He Val Leu Leu Asn Thr Arg 

85 90 95 

Phe Trp Phe lie Phe Ser Thr Thr Asn Ser Phe 

100 105 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Met Lys Glu lie lie Glu Lys Leu Ala Lys Phe Glu Asn Leu Ser Gly 

15 10 15 

Val Glu Met Thr Asp Val lie Glu Arg lie Val Thr Gly Arg Val Thr 

20 25 30 

Glu Ala Gin lie Ala Ser Leu Leu Leu Ala Leu Lys Met Lys Gly Glu 

35 40 45 

Thr Pro Glu Glu Arg Thr Ala lie Ala Gin Val Met Arg Gly His Ala 

50 55 60 

Gin His lie Pro Thr Glu lie His Asp Ala Met Asp Asn Cys Gly Thr 
65 70 75 80 

Gly Gly Asp Lys Ser Phe Ser Phe Asn lie Ser Thr Thr Ala Ala Phe 

85 90 95 

Val Leu Ala Gly Gly Gly lie His Met Ala Lys His Gly Asn Arg Ser 

100 105 110 

lie Ser Ser Lys Ser Gly Ser Ala Asp Ser Xaa Asn Leu Gly Asn Gin 
115 120 125 

Ser 



(2) INFORMATION FOR SEQ ID NO: 343: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 139 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: 

Met Asn Val Gin Met Ser Ser Lys Thr Asn lie Leu Arg Ala His Ala 

15 10 15 

Glu Met Gin Asn lie Gin Arg Arg Ala Asn Glu Glu Arg Gin Asn Leu 

20 25 30 

Gin Arg Tyr Arg Ser Gin Asp Leu Ala Lys Ala lie Leu Pro Ser Leu 

35 40 45 

Asp Asn Leu Glu Arg Ala Leu Ala Val Glu Gly Leu Thr Asp Asp Val 
50 55 60 
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Lys Lys Gly Leu Gly Met Val Gin 
65 70 
Glu Glu Gly lie Glu Glu He Ala 
85 

Tyr His Met Ala He Gin Thr Leu 
100 

Asp Thr He Ala Gin Val Phe Gin 
115 120 
He Leu Arg Pro Ala Met Val Val 
130 135 
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Glu Ser Leu He His Ala Leu Lys 

75 80 
Ala Asp Gly Glu Phe Asp His Asn 

90 95 
Pro Ala Asp Asp Asp His Pro Val 
105 no 
Lys Gly Tyr Lys Leu His Asp Arg 
125 

Val Tyr Asn 



(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Ser Asn Lys Ser Tyr Ser Lys Phe Gin Glu Glu Val Ser Leu Lys 

1 5 10 15 

Tyr Gly Phe He Gly Leu Lys Leu Asp Lys Leu Ser Leu Thr Ala Glu 

20 25 30 

Val Ser Glu Glu Phe His Ser Glu He Leu Ser Gly Asn Phe Thr Leu 

35 40 45 

Tyr Asp He Ser Val 
50 



(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 
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Met His Phe Asp Lys Ser Lys Phe Gly Ala Val Phe Ser Ala Pro Gly 

15 10 15 

Leu Tyr Glu Val Glu Val lie Asn Asn Ala Ser Phe Gly Gin Asn Ala 

20 25 30 

Gin Tyr Glu Val He Gin Ser Arg Lys Leu Gly Thr Phe Ala Glu Leu 

35 40 45 

He Glu Met Ala Lys He Lys 
50 55 

(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

Met He Gin Ala Val Phe Glu Arg Ala Glu Asp Gly Glu Leu Arg Ser 

15 10 15 

Ala Glu He Thr Gly His Ala Glu Ser Gly Glu Tyr Gly Leu Asp Val 

20 25 30 

Val Cys Ala Ser Val Ser Thr Leu Ala He Asn Phe He Asn Ser lie 

35 40 45 

Glu Lys Phe Ala Gly Tyr Glu Pro lie Leu Glu Leu Asn Glu Asp Glu 

50 55 60 

Gly Gly Tyr Leu Met Val Glu lie Pro Lys Asp Leu Pro Ser His Gin 
65 70 75 80 

Arg Glu Met Thr Gin Leu Phe Phe Glu Ser Phe Phe Leu Gly Met Ala 

85 90 95 

Asn Leu Ser Glu Asn Ser Ser Glu Phe Val Gin Thr Arg Val He Thr 
100 105 110 

Glu Asn 



(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



Met Thr Ala lie Ser Met Lys Trp 

1 5 
Glu Ser Glu Phe Thr Pro Asp Met 
20 

Leu Thr Glu Met Leu Gin Ser Gly 

35 40 
Asn Pro Asn Gly Val Asp He Gin 

50 55 
Ser Lys Met Arg Cys Tyr Phe Ser 
65 70 
Glu Thr Thr Ala Glu Thr He Ser 
85 

He Leu Lys Tyr Lys Asn Pro Lys 
100 



Leu Asn Asp Tyr He Trp Pro Ala 

10 15 
Thr Thr Asn Ala Val Lys Glu Ala 
25 30 
Thr Thr Thr Phe Asn Asp Met Tyr 
45 

Gin He Tyr Gin Val Val Lys Thr 
60 

Pro Thr Leu Phe Ser Ser Glu Thr 

75 80 
Arg Thr Arg Ser He He Asp Glu 

90 95 
Phe Gin Gly Leu Trp 
105 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Val Arg Glu Ser Ala Glu Ser Ala Gly Phe Phe Leu Glu Thr His 

1 5 io 15 

Met Val Gin Gly Glu Trp Asn Thr Cys Val Phe Lys Lys Thr Lys Asp 

20 25 30 

He Ser Gly Val He Gly Gly 
35 



(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 49 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:349: 

Met Thr Val Asp Arg Glu Gly Phe Glu Ala Ala Met Lys Glu Gin Gin 

15 10 15 

Glu Arg Ala Arg Ala Ser Ala Val Lys Gly Gly Ser Met Gly Met Gin 

20 25 30 

Asn Gly Asn Ser Ser Lys His His Cys Arg Lys Cys Leu Gin Leu Gin 
35 40 45 

Cys 



(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Lys Ala Gin Leu Val Pro Ser Arg Ser Lys Tyr Leu He Asn Asp 

15 10 15 

Asn Thr Val Val Leu Phe Phe Gly Thr Arg Thr Asp Tyr Thr Arg Lys 

20 25 30 

Tyr Asp Leu Asp Leu Val Arg Glu Val Ala Gly Asp Gin He Ala Arg 

35 40 45 

Arg Val Val Leu Leu Ser Asp Gin Ala Phe Gly Leu Glu Asn Val Lys 

50 55 60 

Glu Val Ala Leu Gly Cys Gly Gly Val Leu Asn Asp He Tyr Arg Val 
65 70 75 80 

Phe Pro Tyr He Val Tyr Ala Gin Leu Phe Ala Leu Leu Thr Ser Leu 

85 90 95 

Lys Val Glu Asn Lys Pro Asp Thr Pro Ser Pro Thr Gly Thr Val Asn 
100 105 110 
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Arg Val Val Gin Gly Val He He His Glu Tyr Gin Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:351: 



Met Ala Val Asn Asn Glu He Gly Ser He Gin Xaa He Glu Ala He 

15 10 15 

Ser Glu Phe Leu Ala Asp Lys Pro Thr lie Ser Phe His Val Asp Ala 

20 25 30 

Val Gin Ala Leu Ala Lys He Pro Thr Glu Lys Tyr Leu Thr Glu Arg 

35 40 45 

Val Asp Cys Ala Thr Phe Ser Ser His Lys Phe His Gly Val Arg Gly 

50 55 60 

Val Gly Phe Val Tyr He Lys Ser Gly Lys Lys He Thr Pro Leu Leu 
65 70 75 bo 

Thr Gly Gly Gly Gin Glu Arg Asp Tyr Arg Ser Thr Thr Glu Asn Val 

85 90 95 

Ala Gly He Ala Ala Thr Ala Lys Ala Leu Arg Leu Ser Met Glu Lys 

1°0 105 no 

Leu Asp He Phe Arg Ser Lys Thr Gly Gin Met Lys Ala Val He His 

115 120 125 

Gin Ala Leu Leu Asn Tyr Pro Asp He Phe Val Phe Ser Asp Glu Glu 

I 30 135 140 

Asp Phe Ala Pro His He Leu Thr Phe Gly He Lys Gly Val Arg Gly 
145 150 155 160 

Glu Val He Val His Ala Phe Glu Asp Tyr Asp He Phe He Ser Thr 

165 170 175 

Thr Ser Ala Cys Ser Ser Lys Ala Gly Lys Pro Ala Gly Thr Leu He 

180 185 190 

Ala Met Gly Val Asp Lys Asp Lys Ala Lys Ser Ala Val Arg Leu Ser 

195 200 205 

Leu Asp Leu Glu Asn Asp Met Ser Gin Val Glu Gin Phe Leu Thr Lys 
21° 215 220 
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Leu Lys Leu He Tyr Asn Gin Thr Arg Lys Val Arg 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met Leu Val Thr Gly Ala He Leu Gly Val Asn Val His He Phe Ser 

15 10 15 

Pro Lys Glu Leu Phe Pro Glu Lys Glu He Val Glu Leu Ala Glu Gly 

20 25 30 

Phe Ala Lys Glu Ser Gly Ala His Val Leu He Thr Glu Asp Ala Asp 

35 40 45 

Glu Ala Val Lys Asp Ala Asp Val Leu Tyr Thr Asp Val Trp Val Ser 

50 55 60 

Met Gly Glu Glu Asp Lys Phe Ala Glu Arg Val Ala Leu Leu Lys Pro 
65 70 75 80 

Tyr Gin Val Asn Met Asp Leu Val Lys Lys Ala Gly Asn Glu Asn Leu 

85 90 95 

He Phe Leu His Cys Leu Pro Ala Phe His Asp Thr His Thr Val Tyr 

100 105 110 

Gly Lys Asp Val Ala Glu Lys Phe Gly Val Glu Glu Met Glu Val Thr 

115 120 125 

Asp Glu Val Phe Arg Ser Lys Tyr Ala Arg His Phe Asp Gin Ala Glu 

130 135 140 

Asn Arg Met His Thr He Lys Ala Val Met Ala Ala Thr Leu Gly Asn 
145 150 155 160 

Leu Tyr He Pro Lys Val 
165 

(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 

Met Arg Leu Glu Gin Asp Cys Pro Val Phe Leu Lys lie Lys Glu Lys 

15 10 15 

Asp Met Ala Ser Lys Met Leu His Thr Cys Leu Arg Val Glu Asn Leu 

20 25 30 

Glu Lys Ser He Ala Phe Tyr Gin Asp Ala Phe Gly Phe Lys Glu Leu 

35 40 45 

Arg Arg Arg Asp Phe Pro Asp His Ala Phe Thr He Val Tyr Leu Gly 

50 55 60 

Leu Glu Gly Asp Asp Tyr Glu Leu Glu Leu Thr Tyr Asn Tyr Asp His 
65 70 75 80 

Gly Pro Tyr Val Val Gly Asp Gly Phe Ala His He Ala Leu Ser Thr 

85 90 95 

Pro Asp Leu Glu Ala Leu His Gin Glu His Ser Thr Lys Gly Tyr Glu 

100 105 no 

Val Thr Glu Pro Asn Gly Leu Pro Gly Thr Ala Pro Asn Tyr Tyr Phe 

115 120 125 

Val Lys Asp Pro Asp Gly Tyr Lys Val Glu Val He Arg Glu Lys 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 3 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 

Met Asp His Thr He Phe Cys Val Gly Tyr Arg Val Met Gin Lys Asp 

15 io 15 

Leu Glu Gly Thr Leu Asp Ala Glu Lys Leu Lys Ala Ala Gly Val Pro 

20 25 30 

Phe Gly Pro Leu Phe Gly Lys He Lys Asn Gly Gin Asp Leu Val Leu 
35 40 45 
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Glu Asp Gly Thr Glu lie Lys Ala Ala Asp Tyr lie Ser Ala Pro Arg 

50 55 60 

Pro Gly Lys lie lie Thr He Leu Gly Asp Thr Arg Lys Thr Asp Ala 
65 70 75 80 

Ser val Arg Leu Ala Val Asn Ala Asp Val Leu Val His Glu Ser Thr 

85 90 95 

Tyr Gly Lys Gly Asp Glu Lys He Ala Arg Asn His Gly His Ser Thr 

100 105 HO 

Asn Met Gin Ala Ala Gin Val Ala Val Glu Ala Gly Ala Lys Arg Leu 

115 120 125 

Leu Leu Asn His He Ser Ala Arg Phe Leu Ser Lys Asp He Ser Lys 

130 135 140 

Leu Lys Lys Asp Ala Ala Thr He Phe Glu Asn Val His Val Val Lys 
145 150 155 160 

Asp Leu Glu Lys Met Glu He Tyr Gin Ser Gin Lys Gly 
165 170 

(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

Met He Gin Pro Ala Ser Leu Glu Glu Leu Ala Ser Leu Val Glu Lys 

15 10 15 

Ala Gly Lys Lys Val Phe Leu Phe Val Ala Asp Trp Cys Gly Asp Cys 

20 25 30 

Arg Tyr He Tyr Pro Ala Leu Pro Glu lie Glu Glu Thr Asn Pro Glu 

35 40 45 

Phe Thr Phe He Arg Met Asp Arg Asp Gin Tyr Met Asp Leu Ala Lys 

50 55 60 

Leu Trp Asp Val Tyr Gly He Pro Ser Leu Val Val Leu Glu Lys Asp 
65 70 75 80 

Lys Glu He Gly Arg Phe Val Asn Arg Asp Arg Lys Ser Lys Glu Gin 

85 90 95 

He Asn Asp Phe Leu Ala Gly Leu Lys 
100 105 
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(2) INFORMATION FOR SEQ ID NO: 356: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 56: 

Met Arg Leu Gly Leu Leu Gin Val Leu Arg Leu Pro Lys Ala Phe Gin 

15 10 is 

Leu He Phe He Gin Asp Lys Gly His Gly Asp Val Ser Ser Phe Thr 

20 25 30 

Ala Ser Cys Val Thr Gly Pro Trp Ala Phe Lys Leu Val Gly Lys Gin 

35 40 45 

Gly Asn He His 
50 

(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met Lys Thr Arg Lys He Pro Leu Arg Lys Ser Val Val Ser Asn Glu 

15 10 15 

Val He Asp Lys Arg Asp Leu Leu Arg He Val Lys Asn Lys Glu Gly 

20 25 30 

Gin Val Phe He Asp Pro Thr Gly Lys Ala Asn Gly Arg Gly Ala Tyr 

35 40 45 

He Lys Leu Asp Asn Ala Glu Ala Leu Glu Ala Lys Lys Lys Lys Val 

50 55 60 

Phe Asn Arg Ser Phe Ser Met Glu Val Glu Glu Ser Phe Tyr Asp Glu 
65 70 75 80 
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Leu lie Ala Tyr Val Asp His Lys Val Lys Arg Arg Glu Leu Gly Leu 
85 90 95 

Glu 



(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35B: 

Met Arg Val Ala Tyr Phe Lys Val His His Pro lie Tyr Tyr Tyr Cys 

15 10 15 

Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Asp He Lys Thr Met Gly 

20 25 30 

Ala Gly Leu Glu Val He Lys Arg Arg Met Glu Glu He Ser Glu Lys 

35 40 45 

Arg Lys Asn Asn Glu Ala Ser Asn Val Glu He Asp Leu Tyr Thr Thr 

50 55 60 

Leu Glu He Val Asn Glu Met Trp Glu Arg Gly Phe Lys Phe Gly Lys 
65 70 75 80 

Leu Asp Leu Tyr Cys Ser Gin Thr Thr Glu Phe Leu He Asp Gly Asp 

85 90 95 

Thr Leu He Pro Pro Phe Val Ala Met Asp Gly Leu Gly Glu Asn Val 

100 105 110 

Ala Lys Gin Leu Val Arg Ala Arg Glu Glu Gly Glu Phe Leu Ser Lys 

115 120 125 

Thr Glu Leu Arg Lys Arg Gly Gly Leu Ser Ser Thr Leu Val Glu Lys 

130 135 140 

Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu He Thr Ser 
145 150 155 

(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 



Met Ser Ser Lys He 

1 5 
Leu Ser Tyr Phe Thr 
20 

Lys Leu Gin Ser Ala 
35 

Leu Val Glu Ser Glu 
50 

Phe Leu Met Gly Asp 
65 

Phe Gly Arg Asp Thr 
85 

Asp Lys Val Ser Leu 
100 

Ala Lys Met He Val 
115 

He Asn His Gin Asp 

130 
Ser Tyr 
145 



Ser He Gly Gin Leu He 
10 

Thr Pro Met Glu Asn He 
25 

Lys Val Ala Asn Asn Arg 
40 

Phe Gin Val Gin Glu Asn 
55 

He Glu Phe Asp Asp Leu 
70 75 
Leu Thr Asp lie Asn Leu 
90 

Val Gly Val Ser Gly Ser 
105 

Asn Phe Phe Glu Pro Tyr 
120 

He Lys Asn He Asp Lys 
135 



Thr Phe Asn Thr Leu 
15 

He Asn Leu Gin Thr 
30 

Leu Asn Glu Val Tyr 
45 

Pro Val His Ser His 
60 

Ser Tyr Lys Tyr Gly 
80 

Thr He Lys Gin Gly 
95 

Gly Lys Thr Thr Leu 
110 

Lys Gly His lie Ser 
125 

Lys Ser Leu Ala Pro 
140 



(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



Met Arg Gin Ala Pro Ala Leu Ala 

1 5 
Val His Lys lie Ser Lys Val Trp 
20 



Gin Ala Asn He Glu Arg Val Val 

10 15 
Glu Phe His Phe Arg He Phe 
25 30 



277 



WO 97/37026 

(2) INFORMATION FOR SEQ ID NO:361: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Gin Thr Gly Trp Gin Tyr Leu Gly Asn Lys Trp Tyr Tyr Leu Arg 

15 10 15 

Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Glu Gly Thr Thr Trp 

20 25 30 

Tyr Tyr Leu Asp Gin Pro Asn Gly Asp Met Lys Thr Gly Trp Gin Asn 

35 40 45 

Leu Gly Asn Lys Trp Tyr Tyr Leu Arg Ser Ser Gly Ala Met Ala Thr 

50 55 60 

Gly Trp Tyr Gin Glu Gly Thr Thr Trp Tyr Tyr Leu Asp Gin Pro Asn 
65 70 75 80 

Gly Asp Met Lys Thr Gly Trp Gin Asn Leu Gly Asn Lys Trp Tyr Tyr 

85 90 95 

Leu Arg Ser Ser Gly Ala Met Ala Thr Gly Trp Tyr Gin Asp Gly Ser 

100 105 110 

Thr Trp Tyr Tyr Leu Asn Ala Val Met Glu lie 
115 120 

(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 : 

Met Lys Trp Gly Ser Arg lie Leu Leu Gly Leu Thr Pro Lys Ser Leu 
15 10 15 
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Arg Tyr Arg lie Trp Lys Lys Ala Glu Lys Glu Met Thr Lys Tyr Asp 

20 25 30 

Leu Ala Asp Cys Asp Gly He Thr Glu Leu Cys Ser Gly Pro Gly Tyr 

35 40 45 

Met Arg Asn Lys Tyr Pro He Thr Ser Phe Glu Asp Asn Leu Phe Leu 

50 55 60 

Pro Phe Glu Gly Thr Glu Met Pro He Pro He Gly Tyr Asp Val Tyr 
65 70 75 80 

Leu Arg Thr Ala Phe Gly Asp Tyr Met Thr Pro Pro Pro Ala Asp Lys 

85 90 95 

Gin Val Pro His Gin Asp Ala Val He Ala Asp Met Asp Lys Ser Tyr 

100 105 no 

Thr Glu Tyr Lys Gly Glu Tyr Gly Gly 
115 120 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

Met Phe Pro Asp Ser He Asp Thr Met His Gin Ala Asn Glu Phe lie 

15 io is 

Ala Leu Asp Asp Leu Phe Arg Ala Ala Ala He Tyr Ala Glu Ala He 

20 25 30 

Tyr Glu Leu He Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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Met Asp Glu Leu Gin Lys Arg Asn Leu Leu Gly Phe Val Phe Gin Asp 

15 10 15 

Phe Gin Leu Phe Pro His Leu Ser Val Leu Glu Asn Leu Thr Leu Ser 

20 25 30 

Pro Val Lys Thr Met Gly Met Lys Gin Glu Glu Ala Glu Lys Lys Ala 

35 40 45 

Ser Gly Leu Leu Glu Gin Leu Gly Leu Gly Gly His Ala Glu Ser Tyr 

50 55 60 

Pro Phe Ser Leu Ser Gly Gly Gin Lys Gin Arg Val Ala Leu Ala Arg 
65 70 75 80 

Ala Met Met lie Asp Pro Glu He He Gly Tyr Asp Glu- Pro Thr Ser 

85 90 95 

Ala Leu Asp Pro Glu Leu Arg Leu Glu Val Glu Lys Leu He Leu Gin 

100 105 110 

Asn Arg Glu Leu Gly Met Thr His He Val Val Thr His Asp Leu Gin 
115 120 . 125 

Phe Gly 
130 

(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 65: 

Met Tyr Gly Ala Asn Ser Glu Arg Leu Lys Glu Ser Leu Arg He Ser 

15 10 15 

Leu Ser Pro Gin Asn Thr Val Glu Asp Leu Gin Thr Leu Ala Lys Thr 

20 25 30 

Leu Lys Glu He He Gly Gly 
35 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 41 amino acids 
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(B) TYPE: amino acid 

( C ) STRANDEDNES S : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 



Met He Ala Arg Ser Tyr Gin Glu Met Asp Asp Leu Asp Thr Ala Tyr 

1 5 10 15 

Glu His Tyr Gin Glu Leu Thr Gly Asp Leu Lys Asp Asn Pro Glu Phe 

20 25 30 

Leu Glu His Tyr He Tyr Leu Leu Arg 
35 40 



(2) INFORMATION FOR SEQ ID NO: 367: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 67: 



Met Lys His Phe Asp Thr He Val He Gly Gly Gly Pro Ala Gly Met 

1 5 10 15 

Met Ala Thr He Ser Ser Ser Phe Tyr Gly Gin Lys Thr Leu Leu He 

20 25 30 

Glu Lys Asn Arg Lys Leu Gly Lys Lys Leu Ala Gly Thr Gly Gly Gly 

35 40 45 

Arg Cys Asn Val Thr Asn Asn Gly Ser Leu Asp Asn Leu Leu Ala Gly 

50 55 60 

He Pro Gly Asn Gly Arg Phe Leu Tyr Ser Val Phe Ser Gin Phe Asp 
65 70 75 80 

Asn His Asp He He Asn Phe Phe Thr Glu Asn Gly Val Lys Leu Lys 

85 90 95 

Val Glu Asp His Gly Arg Val Phe Pro Ala Ser Asp Lys Ser Arg Thr 

100 105 HO 

He lie Glu Ala Leu Glu Lys Lys He Thr Glu Leu Gly Gly Gin Val 
115 120 125 
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Ala Thr Pro Asn Arg Asn Arg Phe Cys 
130 135 



(2) INFORMATION FOR SEQ ID NO; 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Thr Leu Ala Lys Asp He Ala Ser His Leu Leu Lys He Gin Ala 

15 10 15 

Val Tyr Leu Lys Pro Glu Glu Pro Phe Thr Trp Ala Ser Gly He Lys 

20 25 30 

Ser Pro He Tyr Thr Asp Asn Arg Val Thr Leu Ala Tyr Pro Glu Thr 

35 40 45 

Arg Thr Leu He Glu Asn Gly Phe Val Glu Ala He Lys Glu Ala Phe 

50 55 60 

Pro Glu Val Glu Val He Ala Gly Thr Ala Thr Ala Gly He Pro His 
65 70 75 80 

Gly Ala He He Ala Asp Lys Met Asp Leu Pro Phe Ala Tyr He Pro 



<2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Val Lys Val Leu Ala Ala Cys Gly Asn Gly Met Gly Ser Ser Met 

1 5 10 ' 15 

Val He Lys Met Lys Val Glu Asn Ala Leu Arg Lys Leu Asn Gin Thr 



85 



90 



95 



20 



25 



30 
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Asp Phe Thr Val Asn Ser Cys Ser 

35 40 
Val Gly Tyr Asp He Val He Ala 

50 55 
Glu Gly Arg Thr Asn Gly Lys Leu 
65 70 
Asp Lys Glu He Thr Glu Lys Leu 
85 
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Val Gly Glu Ala Lys Gly Leu Ala 
45 

Ser Leu His Leu He Gin Glu Leu 
60 

He Gly Leu Asp Asn Leu Met Asp 

75 80 
Ser Gin Ala He Gin 
90 



(2) INFORMATION FOR SEQ ID NO: 370: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 



Met Ser Met Phe Leu 

1 5 
Gly Gly Asp Gly Met 
20 

Gly Gly Pro Trp Gly 
35 

Val Val Asp Glu Gly 
50 

His Phe Lys Ala Asp 
65 

Gly Arg Gly Ala Glu 
85 

Val Arg Asp Ala Glu 
100 

Gly Gin Glu Phe He 
115 

Tyr Ser Phe Arg Asp 
130 



Asp Thr Ala Lys lie Lys 
10 

Val Ala Phe Arg Arg Glu 
25 

Gly Asp Gly Gly Arg Gly 
40 

Leu Arg Thr Leu Met Asp 
55 

Ser Gly Glu Lys Gly Met 
70 75 
Asp Leu Arg Val Arg Val 
90 

Thr Gly Lys Val Leu Thr 
105 

Val Ala His Gly Gly Pro 
120 

Thr Lys Lys Ser Cys Thr 
135 



Val Lys Ala Gly Asn 
15 

Lys Tyr Val Pro Asn 
30 

Gly Asn Val Val Phe 
45 

Phe Arg Tyr Asn Arg 
60 

Thr Lys Gly Met His 
80 

Pro Gin Gly Thr Thr 
95 

Asp Leu He Glu His 
110 

Trp Trp Thr Trp Lys 

125 
Gly Asn Leu 
140 



(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 
(A). LENGTH: 210 amino acids 



283 



WO 97/37026 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

Met Met Lys Gly Glu Met Thr Phe Lys Gin Val His Tyr Lys Tyr Gly 

15 10 15 

Tyr Gly Arg Asp Val Leu Ser Asp lie Asn Leu Thr Val Pro Gin Gly 

20 25 30 

Ser Lys Val Ala Phe Val Gly lie Ser Gly Ser Gly Lys Thr Thr Leu 

35 40 45 

Ala Lys Met Met Val Asn Phe Tyr Asp Pro Ser Gin Gly Glu lie Ser 

50 55 60 

Leu Gly Gly Val Asn Leu Asn Gin lie Asp Lys Lys Ala Leu Arg Gin 
65 70 75 80 

Tyr lie lie Tyr Leu Pro Pro Gin Pro Tyr Val Phe Asn Gly Thr lie 

85 90 95 

Leu Glu Asn Leu Leu Leu Gly Ala Lys Gly Gly Asp Asp Thr Gly Lys 

100 105 110 

lie Ser Leu Thr Gly Arg Ser Glu Phe Gly Ser Glu lie Pro Lys Arg 

115 120 125 

Asn lie Ser Lys Pro Pro Cys His Leu Glu lie Tyr Gin Thr Glu Leu 

130 135 140 

Thr Ser Asp Gly Ala Gly lie Ser Gly Gly Gin Arg Gin Arg He Ala 
145 150 155 160 

Leu Ala Arg Ala Leu Leu Thr Asp Ala Pro Val Leu He Leu Asp Glu 

165 170 175 

Ala Thr Asn Ser Leu Asp He Leu Thr Lys Lys Arg He Val His Tyr 

180 185 190 

Leu Met Ala Leu Asp Lys Thr Phe Asp Phe His Cys Ser Pro Leu Asp 
195 200 205 

Tyr Cys 
210 

(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Met Ala Arg Phe He Glu He Asp Ala Lys Glu His Asp Arg Val Thr 

15 10 15 

Ser Gin He Ser His Phe Pro His He Leu Ala Ser Ser Leu Met Glu 



Gly Arg Trp Phe Ser Arg Tyr Asp Pro Asn Cys Gly Lys Arg Ala Arg 

50 55 60 

Asn Val Asp Leu His Ser Leu Val Gin 



(2) INFORMATION FOR SEQ ID NO: 3 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

Met Ala His Gin Gly Gin Asp He Tyr Asp Phe Pro Arg Ala Met He 

15 10 is 

Lys Glu Asp Asn Leu Glu Phe Ser Phe Ser Gly Leu Lys Ser Ala Phe 

20 25 30 

He Asn Leu His His Asn Ala Glu Gin Lys Gly Glu Ser Leu Ser Thr 

35 40 45 

Glu Asp Leu Cys Ala Ser Phe Gin Ala Ala Val Leu Asp He Leu Met 

50 55 60 

Ala Lys Thr Lys Lys Ala Leu Glu Lys Tyr Pro Val Lys Thr Leu Val 
65 70 75 80 

Val Ala Gly Gly Val Ala Ala Asn Lys Gly Leu Arg Glu Arg Leu Ala 

85 90 95 

Ala Glu Val Thr Asp Val Lys Val He He Pro Pro Leu Arg Leu Cys 

100 105 no 

Gly Asp Asn Ala Gly Met He Ala Tyr Ala Ser Val Ser Glu Trp Asn 




Gin Thr Ala Val Tyr 
35 



Ala Gin Glu His Glu Asn Gly Lys Ala Leu Cys 
40 45 



65 



70 



115 



120 



125 
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Lys Glu Asn Phe Ala Asn Leu Asp Leu Asn Ala Lys Pro Ser Leu Ala 

130 135 140 

Phe Asp Thr Met Glu 
145 

(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 374: 



Met Lys Arg Ser Leu Asp Ser Arg Val Asp Tyr Ser Leu Leu Leu Pro 

15 10 15 

Val Phe Phe Leu Leu Val He Gly Val Val Ala He Tyr He Ala Val 

20 25 30 

Ser His Asp Tyr Pro Asn Asn He Leu Pro He Leu Gly Gin Gin Val 

35 40 45 

Ala Trp He Ala Leu Gly Leu Val He Gly Phe Val Val Met Leu Phe 

50 55 60 

Asn Thr Glu Phe Leu Trp Lys Val Thr Pro Phe Leu Tyr He Phe Arg 
65 70 75 80 

Leu Gly Thr Tyr Asp Leu Ala Asp Cys He Leu 
85 90 

(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Met Pro Asp Asp He He Asp Ser Phe Trp Tyr lie He Asp His Phe 
15 10 15 
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Leu Lys Asn Val Phe Glu Leu Glu 
20 

Asn Asn Gin Gly Lys lie Thr Phe 

35 40 
Thr Ala lie Asp Phe Asp Phe Asn 

50 55 
Pro Arg Val Leu Val Leu Asp Met 
65 70 
Pro Glu Glu Asn Asp Leu Phe 
85 



PCT/US97/05306 

Glu Glu Leu Glu Phe Gin Leu Leu 
25 30 
His Phe Ser Ser Gin His Leu Pro 
45 

His Pro Phe Asp Pro Arg Tyr Pro 
60 

Asp Gly Arg Glu Thr lie Leu Leu 
75 80 



(2) INFORMATION FOR SEQ ID NO:376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 



Met Gly Lys Lys Asp Ala Ser Ala Met His Glu Met Arg Ala Ser Phe 

1 5 io 15 

He Gin Gly Ser He Glu Ala Gly His Thr Ala Glu Lys Ser Glu Gin 

20 25 30 

Val Phe Asp Val Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser 

35 40 45 

His Ala Tyr Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys 

50 55 60 

Thr His Tyr Pro Ala He Phe Tyr Gin Val Met Leu Asn Tyr Ser Asn 
65 70 75 80 

Ser Asp Tyr Leu He Asp Ala Leu Glu Ala Gly Phe Glu Val Ala Ser 

85 90 95 

Leu Ser He Asn Thr He Pro Tyr His Asp Lys He Ala Asn Lys Ser 

100 105 no 

He Tyr He Gly Leu Lys Ser He Lys Gly Leu Gin Gin Gly Leu Gly 

H5 120 125 

Ala Leu Asp Tyr 
130 



(2) INFORMATION FOR SEQ ID NO: 377: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 



Met Asp Lys Lys Gin Asn Leu Thr 

1 5 
Glu Leu Asn Gin He Thr Gly Gly 
20 

Glu Thr He Leu Ser Lys Phe Lys 

35 40 
He Gin Leu 
50 



Ser Phe Gin Glu Leu Thr Thr Thr 

10 15 
Glu Trp Trp Glu Glu Leu Leu His 
25 30 
He Thr Lys Ala Leu Glu Leu Pro 
45 



(2) INFORMATION FOR SEQ ID NO: 378: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

Met His Glu Met Phe Thr Ala He Ala Glu Ser Asp Met Lys Lys Ala 

15 10 15 

Ala Ala He Gin Arg Lys Phe He Pro Lys Val Asn Ala Leu Phe Ser 

20 25 30 

Tyr Pro Ser Pro Ala Pro Val Lys Ala He Leu Asn Tyr Met Gly Phe 

35 40 45 

Glu Ala Gly Pro Thr Arg Leu Pro Leu Val Pro Ala Pro Glu Glu Asp 

50 55 60 

Val Lys Arg He He Lys Val Val Val Asp Gly Asp Tyr Glu Ala Thr 
65 70 75 80 

Lys Ala Thr Val Thr Gly Val Leu Arg Pro Asp Tyr 
85 90 
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(2) INFORMATION FOR SEQ ID NO: 379: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Met Leu Asn Glu Phe Pro lie Phe Asp Tyr Glu Asp lie Gin Leu He 

15 10 15 

Pro Asn Lys Cys Val He Lys Ser Arg Ala Glu Ala Asp Thr Ser Val 

20 25 30 

Thr Leu Gly Asn His Thr Phe Lys Leu Pro Val Val Pro Ala Asn Met 

35 40 45 

Gin Thr He Leu Asp Glu Asn Val Ala Glu Gin Leu Ala Lys Gly Gly 

50 55 60 

Tyr Leu Tyr Thr Tyr Ala Pro Phe 
65 70 

(2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 133 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

Met Asp He Lys Leu Lys Asp Phe Glu Gly Pro Leu Asp Leu Leu Leu 

1 5 io 15 

His Leu Val Ser Lys Tyr Gin Met Asp He Tyr Asp Val Pro He Thr 

20 25 30 

Glu Val He Glu Gin Tyr Leu Ala Tyr Val Ser Thr Leu Gin Ala Met 

35 40 45 

Arg Leu Glu Val Thr Gly Glu Tyr Met Val Met Ala Ser Gin Leu Met 
50 55 60 
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Leu lie Lys Ser Arg Lys Leu Leu Pro Lys Val Ala Glu Val Thr Asp 
65 70 75 80 

Leu Gly Asp Asp Leu Glu Gin Asp Leu Leu Ser Gin lie Glu Glu Tyr 

85 90 95 

Arg Lys Phe Lys Leu Leu Gly Glu His Leu Glu Ala Lys His Gin Glu 

100 105 HO 

Thr Gly Pro Val Leu Phe Gin Ser Ala Asp Arg Val Asp Leu Arg Arg 

115 120 125 

Cys Gly Ala Cys Ala 
130 

(2) INFORMATION FOR SEQ ID NO: 3 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 



Met Gin Ser Thr Glu Lys Lys Pro Leu Thr Ala Phe Thr Xaa lie Ser 

15 10 15 

Thr lie He Leu Leu Leu Leu Thr Val Leu Xaa He Phe Pro Phe Tyr 

20 25 30 

Trp He Leu Thr Gly Ala Phe Lys Ser Gin Pro Asp Thr He Val He 

35 40 45 

Pro Pro Gin Trp Phe Pro Lys Met Pro Thr Met Glu Asn Phe Gin Gin 

50 55 * 60 

Leu Met Val Gin Asn Pro Ala Leu Gin Trp Met Trp Asn Ser Val Phe 
65 70 75 80 

He Ser Leu Val Thr Met Phe Leu Val Cys Ala Thr Ser Ser Leu Ala 

85 90 95 

Gly Tyr Val Leu Ala Xaa Lys Arg Phe Tyr Gly Xaa Arg He Tyr Cys 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 382 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

Met Leu Phe Met Arg Asp Ser Leu Asp Ser lie Val Glu Pro Val Leu 

15 10 15 

Asp Glu Met Gly Arg Phe Tyr Asp Trp Thr Glu Glu Glu Lys Ala Thr 

20 25 30 

Tyr Arg Ala Asp Val Lys Ala Ala Leu Ala Gin Asn Asp Leu Ala Glu 

35 40 45 

Leu Lys Asn 
50 

(2) INFORMATION FOR SEQ ID NO:383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

Met Lys lie Lys Val Val Thr Val Gly Lys Leu Lys Glu Lys Tyr Leu 

15 10 15 

Lys Asp Gly He Ala Glu Tyr Ser Lys Arg He Ser Arg Phe Ala Lys 

20 25 30 

Phe Glu Met He Glu Leu Ser Asp Glu Lys Thr Pro Asp Lys Ala Ser 

35 40 45 

Glu Ser Glu Asn Gin Lys He Leu Glu He Glu Gly Gin Arg He Leu 

50 55 60 

Ser Lys He Ala Asp Arg Asp Phe Val lie Val Leu Ala He Glu Gly 
65 7 0 75 80 

Lys Thr Phe Phe Ser Glu Glu Phe Ser Lys Gin Leu Glu Glu Thr Ser 

85 90 95 

lie Lys Gly Xaa Ser Thr Leu Thr Phe lie lie Gly Gly Ser Leu Gly 

100 105 no 

Leu Ser Ser Ser Val Lys Asn Arg Ala Asn Leu Ser Val Ser Phe Gly 
"5 120 125 
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Arg Leu Thr Leu Pro 
130 

lie Tyr Arg Ala Phe 
145 



His Gin Leu Met Arg Leu 
135 

Thr He Gin Gin Gly Phe 
150 155 
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Val Leu Val Glu Gin 
140 

Pro Tyr His Lys 



(2) INFORMATION FOR SEQ ID NO: 3 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



Met Gly He Leu Ala Trp Cys Asp 

1 5 
Ser Phe Met Asp Leu Gly Leu Phe 
20 

Lys Lys Thr Leu Asn Lys Glu Arg 

35 40 
Pro Glu Asn Arg Trp He Gin Leu 

50 55 
Val Val Met Leu Ser Phe Leu He 
65 70 
Trp Phe Lys Lys 



Leu Val Leu Tyr Arg Leu Trp Asp 

10 15 
lie Asn Asp Ala Trp Val Arg Lys 
25 30 
Lys Lys Ala Gly Lys Ala Ala Leu 
45 

Leu Gly Met Val Val Thr Phe His 
60 

Phe Ser Gly Phe Leu Asn Asn Leu 
75 80 



(2) INFORMATION FOR SEQ ID NO: 385: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 



Met Val Met Ala Glu Asp Gin Ala Val Arg Gin Asn Arg Leu Ala lie 
15 10 15 
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Leu Ser Gin Leu Thr Lys Lys Ala Ala Lys Phe Ala Cys Phe Asn Gin 

20 25 30 

lie Asn Thr Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 386: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 



Met Gly Ala Gin Pro Val Gin Asp 

1 5 
Ser Ser His Tyr Leu Asp Glu Gin 
20 

Glu Leu Gin Trp Phe Glu Leu Glu 
35 40 

His 



Thr Glu Thr Ser Ser Ala Leu He 

10 is 
Asp Leu Ser Glu Lys Leu Lys Ser 
25 30 
Asn Lys Leu Leu Asn Leu Trp Glu 
45 



(2) INFORMATION FOR SEQ ID NO:387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

Met Asp Glu Val Thr He Phe Gly He Asn Tyr Phe Lys Glu His Tyr 

15 10 15 

Pro Glu Lys Leu Ala Glu Arg Phe Lys Gin Met Lys He Glu Glu Glu 

20 25 30 

Ala Pro Val He He Met Asp Met Thr Arg Ala Leu Gly Phe Arg Asp 
35 40 45 
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Asp Tyr Asp Arg Phe Tyr Ser Leu Phe Arg Glu Gly Ser Pro 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

Met Lys Val lie Asn Gin Thr Leu Leu Glu Lys Val lie lie Glu Arg 

15 10 15 

Ser Arg Ser Ser His Lys Gly Asp Tyr Gly Xaa Leu Leu Leu Leu Gly 

20 25 30 

Gly Thr Tyr Pro Tyr Gly Val Xaa Ser Ser Trp Leu Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 3 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

Met Asn Trp lie Leu Leu Gin Gly Leu lie Cys Gin Arg Asp Ala Ser 

15 10 15 

Tyr Asp Met Lys Gin Asp Asp Leu Asp Lys Val Ala Asp Tyr Leu Phe 

20 25 30 

Lys Thr Glu Glu Trp Thr Met Tyr Glu Leu lie Leu Phe Gly Asn Leu 

35 40 45 

Tyr Ser Phe Tyr Asp Val Asp Tyr Val Thr Arg lie Gly Arg Glu Val 

50 55 60 

Met Glu Arg Glu Glu Phe Tyr Gin Glu lie Ser Arg His Lys Arg Leu 
65 70 75 80 
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Val Leu He Leu Ala Leu Asn Cys Tyr Gin His Cys Leu Glu His Ser 

85 90 95 

Ser Phe Tyr Asn Ala Asn Tyr Phe Glu Ala Tyr Thr Glu Lys He He 

100 105 no 

Asp Lys Gly He Lys Leu Tyr Glu Arg Asn Val Phe His Tyr Leu Lys 

115 120 125 

Gly Phe Ala Leu Tyr Gin Lys Gly Gin Cys Lys Glu Gly Cys Lys Gin 

130 135 140 

Met Gin Glu Thr Met His He Phe Asp Val Leu Gly Leu Pro Glu Gin 
145 150 155 i 60 

Val Ser Leu Leu Ser Gly Thr Leu Arg Lys He Cys Gin Lys Leu He 

165 170 i7 5 

Phe Pro Asn Lys Gly Lys Asn Lys Lys Leu Leu* Ser Val Leu He Gin 
180 185 190 

(2) INFORMATION FOR SEQ ID NO: 3 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xf) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Ser Thr Val Leu Ser Trp Thr Ala Tyr Lys Thr Gin Arg Leu Glu 

Met Ser He Leu Leu His Met He Val Asn Gly He Ala Phe Cys Leu 

20 25 30 

Leu Ala Leu Val Val lie Met Ser Arg Thr Leu Gly He Ser Val 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 



Met Arg Glu Asn Asp Leu Leu Leu He Thr Ala Asp His Gly Asn Asp 

15 10 15 

Pro Thr Tyr Ala Gly Thr Asp His Thr Arg Glu Tyr He Pro Leu Leu 

20 25 30 

Ala Tyr Ser Pro Ala Phe Lys Gly Asn Gly Leu He Pro Val Gly His 

35 40 45 

Phe Ala Asp He Ser Ala Thr Val Ala Asp Asn Phe Gly Val Glu Thr 

50 55 60 

Ala Met He Gly Glu Ser Phe Leu Asp Lys Leu Val 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 

Met Lys Lys He Phe Leu Thr Leu Leu Thr Val Ser Leu Leu Gly Gly 

15 10 15 

Ala Ser Thr Ala Val Ala Gin Asp Phe Thr He Ala Ala Lys His Ala 

20 25 30 

He Ala Val Glu Ala Asn Thr Gly Lys He Leu Tyr Glu Lys Asp Ala 

35 40 45 

Thr Pro Pro Val Glu He Ala Ser He Thr Lys Leu He Thr Val Tyr 

50 55 60 

Leu Val Tyr Glu Ala Leu Glu Asn Gly Ser He Thr Leu Ser Thr Pro 
65 70 75 80 

Val Asp lie Ser Asp Tyr Pro Tyr Gin Leu Thr Thr Asn Ser Glu Ala 

85 90 95 

Ser Asn He Pro Met Glu Ala Arg Asn Tyr Thr Val Glu Glu Leu Leu 

100 105 HO 

Glu Ala Thr Leu Val Ser Ser Ala Asn Ser Ala Ala He Ala Leu Ala 

115 120 125 

Glu Lys He Ala Gly Ser Glu Lys Asp Phe Val Xaa Met Met Arg Ala 
130 135 140 
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Lys Leu Leu Glu Trp Gly lie Arg lie His Cys Cys Gin Tyr Asp Arg 
145 150 155 160 

Ser 



(2) INFORMATION FOR SEQ ID NO: 393: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C> STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 93: 

Met Ser Asn Ala Met Glu Gly Val Leu Tyr Phe Leu Lys Pro Asp Phe 

15 10 15 

Ser Lys Leu Thr Ser Ala Asp Leu Leu Tyr Ala Leu Gly Gin Ser Phe 

20 25 30 

Phe Ala Leu Ser Leu Gly Val Thr Asp Met Leu Thr Tyr Ala Ser Tyr 

35 40 45 

Leu Asp Lys Lys Thr Asn Leu Val Gin Ser Gly lie Ser lie Val Thr 

50 55 60 

Met Asn lie Ser lie Val His His Gly Arg Ser Ser His Phe Pro Ser 
65 70 75 80 

His Val Ser Leu Gin Tyr Pro Leu 
85 

(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: 

Met Lys Val Ala Lys Pro Phe Trp Ala He Val Lys Leu Lys Ser Val 
15 10 15 
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Gin His Ala Ser Met Leu Glu Asn 
20 

Lys Gin Val Glu Asn Leu Ala Leu 

35 40 
Lys Glu Asn Lys Ala Phe Glu Gin 

50 55 
Gin Thr Phe Leu Thr lie Phe Leu 
65 70 



PCIYUS97/05306 

Pro Lys Glu Met Asp Gly Leu Met 
25 30 
Glu Asn Gin Gly Tyr Gin Val Glu 
45 

lie Lys Asp Ser Val Ala Thr Phe 
60 

Tyr Gly Cys 
75 



(2) INFORMATION FOR SEQ ID NO: 3 95: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 



Met Asp Ala Glu Val Ser Lys Asn Leu Arg Leu lie Leu Glu Arg Lys 

15 10 15 

Gly Met Thr lie Leu Thr Gly Thr Lys Leu Gin Glu lie He Glu Glu 

20 25 -30 

Asn Gly Gin Leu Arg He Lys Val Glu Gly Lys Asp Asn He He Ala 

35 40 45 

Ser Lys Ala Leu Leu Ser He Gly Arg Met Pro Asp Leu Glu Gly He 

50 55 60 

Gly Glu Val Glu Phe Glu Leu Asp Arg Gly Cys He Lys Val Asn Glu 
65 70 75 80 

Tyr Met Glu Thr Ser Val Pro Arg He Tyr Ala Thr Arg 

85 90 



(2) INFORMATION FOR SEQ ID NO: 396 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 



Met Lys Asn Ser lie Met Asp Thr 

1 5 
Leu Asn Lys He He He Val Phe 
20 

Leu Leu Tyr He Val Val Ala Ser 

35 40 
Ser Arg Gly He Ser Phe Asn Pro 

50 55 
Gin Arg Val He Gin 
65 



Lys Phe Asp Arg Arg He Leu Leu 

10 15 
He Val Leu Met Thr Leu Leu Pro 
25 30 
Phe Met Asp Pro Lys Val Leu Val 
45 

Ala Asp Trp Thr Val Glu Gly Tyr 
60 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 



Met Val He Pro Glu Ala Gly Tyr 

1 5 
Leu Lys Gly Lys Gly Lys Glu Ala 
20 

Leu Ala Ser Ala Lys Val He Gin 

35 40 
Gly Lys He Gly He He Leu Asn 

50 55 
Asn Ser Pro Glu Asp Leu Glu Ala 
65 70 
Asn Lys Val Phe Leu Glu Ser Ser 
85 



Leu Tyr Ala Phe His Tyr Pro Asn 

10 15 
Val Gin Val He Tyr Asn Leu Asn 
25 30 
Leu Tyr Arg Ser Leu Gly Leu Asp 
45 

Leu Thr Pro Ala Tyr Pro Arg Ser 
60 

Ser Arg Phe Thr Asp Asp Phe Phe 
75 80 

Cys 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

299 



WO 97/37026 PCT/US9 7/05306 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 



Met Thr Arg He Ala Ser Val Ser 

1 5 
Ala He Ala Leu Ser Phe Leu Gly 
20 

He Pro Asn Ala Val Leu Gly Gly 

35 40 
He Ala Ser Asn Gly Leu Lys Val 

50 55 
Ala Gin Met Arg Asn Leu He He 
65 70 
Leu Gly Arg Ser Tyr Pro 
85 



Val He Arg Asn Ala Ala Phe He 

10 15 
Lys Phe Thr Ala Leu He Ser Thr 
25 30 
Met Ser He Leu Leu Tyr Gly Val 
45 

Leu He Lys Glu Arg Val Asp Phe 
60 

Ala Ser Ala Met Leu Val Leu Gly 
75 80 



(2) INFORMATION FOR SEQ ID NO: 399: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:399: 



Met Ser Leu Arg Glu Lys Ser Met 

1 5 
Asn Trp Thr Arg Val Ala Val Phe 
20 

Arg Thr Asp Phe Asp Ala Phe Val 

35 40 
Val Leu Glu Glu Asp Leu Pro Leu 

50 55 
Ser Val Thr Glu Glu Glu Arg Gin 
65 70 
Phe Val Gin Ala Arg Pro Glu Lys 
85 



Ser Glu Tyr Lys Leu Ser Glu Asn 

10 15 
Ala Gly Gly Asn Arg Gly His Tyr 
25 30 
Gly Val Asp Arg Gly Ser Leu Trp 
45 

Ala Leu Ala Val Gly Asp Phe Asp 
60 

Val He Gin Lys Arg Ala Gin Tyr 

75 80 
Asp Asp Thr Asp Leu Glu Leu Ala 
90 95 
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Leu Leu Thr lie Phe Glu Gin Asn 
100 

Gly Ala Leu Gly Gly Arg lie Asp 
115 120 

Thr 
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Pro Gin Ala Glu Val Thr He Phe 
105 110 
His Met Leu Ala Asn Val Leu Ser 
125 



(2) INFORMATION FOR SEQ ID NO: 4 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:400: 



Met Val Leu Gin Arg Asn Glu He Asn Glu Lys Asp Thr Trp Asp Leu 

15 10 15 

Ser Thr He Tyr Pro Thr Asp Gin Ala Trp Glu Glu Ala Leu Lys Asp 

20 25 30 

Leu Thr Glu Gin Leu Glu Thr Val Ala Gin Tyr Glu Gly His Leu Leu 

35 40 45 

Asp Ser Ala Asp Asn Leu Leu Val Asn His 
50 55 



(2) INFORMATION FOR SEQ ID NO: 401: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



Met He Tyr Gly Ala He He Asp Ala Lys Thr Ala Glu Asn Ala Ala 

1 5 10 15 

Gly Met Thr Ala Met Gin Thr Ala Thr Asp Asn Ala Lys Lys Val He 
20 25 30 
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Asn Asp Leu Thr lie Gin Tyr Asn Arg Ala Gin Thr Gly Gly Tyr Tyr 

35 40 45 

Thr Arg Asn Tyr Arg Asn Arg Ser Ser Ala 



(2) INFORMATION FOR SEQ ID NO:402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Met Thr Asp Gly Tyr Glu His Tyr Lys Lys lie Met Glu Asn Pro Pro 

15 10 15 

Lys Asn Leu lie Phe Pro Gly lie Val Ser Pro Glu Arg Met Arg Glu 

20 25 30 

lie Val Cys Ser Asn Gly Ser Phe Leu Val Ala 



(2) INFORMATION FOR SEQ ID NO:403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 

Met Ser Lys Gin Lys Lys Phe Glu Glu Asn Leu Ala Glu Leu Glu Thr 

15 10 15 

lie Val Gin Ser Leu Glu Asn Gly Glu lie Ala Leu Glu Asp Ala lie 

20 25 30 

Thr Ala Phe Gin Lys Gly Met Val Leu Ser Lys Glu Leu Gin Ala Thr 

35 40 45 

Leu Asp Lys Ala Glu Lys Thr Leu Val Lys Val Met Gin Glu Asp Gly 
50 55 60 



50 



55 



35 



40 
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Thr Glu Ser Asp Phe Glu 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:404: 



Met Phe Ala Ala Val Thr Ala Gly Tyr Gin Ala Ala Leu Met Val Pro 

15 10 15 

Thr Glu He Leu Ala Glu Gin His Phe Glu Ser Leu Gin Asn Leu Phe 

20 25 30 

Pro Asn Leu Lys Leu Ala Leu Leu Thr Gly Ser Leu Lys Ala Ala Glu 

35 40 45 

Lys Arg Glu Val Leu Glu Thr He Ala Lys Gly Glu Ala Asp Leu He 

50 55 60 

He Gly Thr His Ala Leu He Gin Asp Gly Val Glu Tyr Ala Arg Leu 
65 70 75 80 

Gly Leu He He He Asp Glu Gin His Arg Phe Gly Val Gly Gin Arg 

85 90 95 

Arg He Leu Arg Glu Lys Gly Asp Asn Pro Asp Val Leu Met Met Thr 

100 105 no 

Ala Thr Pro He Pro Arg Thr Leu Ala He Thr Ala Phe Gly Asp Met 

115 120 125 

Asp Val Ser He He Asp Gin Met Pro Ala Gly Arg Lys Pro Tyr Cys 

!30 135 140 

Asp Ala Leu Asp Gin Thr 
145 150 



(2) INFORMATION FOR SEQ ID NO:405: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:405: 



Met Asn Arg Glu Ala Leu Arg Leu Tyr Leu Val Thr 

15 10 
Asp Ser Val Glu Ser Phe Leu Ala Lys Val Glu Thr 

20 25 
Gly Val Thr lie Val Gin Leu Arg Glu Lys Asn Leu 

35 40 
Tyr Tyr Gin Leu Ala Lys Gin Val Lys Glu lie Thr 

50 55 60 

Val Pro Leu lie lie Asp Asp Arg Leu Asp Val Cys 
65 70 75 

Ala Ala Gly Leu His lie Gly Asp Asp Glu Leu Pro 

85 90 
Arg Gin Val Leu Gly Pro Glu Lys lie Pro Arg Cys 
100 105 



Asn Arg Tyr Gin 
15 

Ala Cys Arg Ser 
30 

Thr Thr Asn Gin 
45 

Asp Ala Tyr Gin 

Leu Ala Val Asp 
80 

Val Ser Val Ala 
95 

His Arg 
110 



(2) INFORMATION FOR SEQ ID NO: 406: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



Met Gin Gly Ser His Phe His Leu 

1 5 
Ala Leu Leu Asp Glu Thr Lys Lys 
20 

Thr Gin Ser Lys Asp Ser Val Asp 

35 40 
Asn Phe Val Leu Val Met Gly Asn 

50 55 
Met Ala Glu Ser Ala Asp Gin Leu 
65 70 
Ala Glu Ser Leu Asn Val Ala Val 
85 

Leu Ser 



Pro lie Tyr Arg Met Thr Ser Gin 

10 15 
Val Ala lie Pro Val Leu Ala Thr 
25 30 
Tyr Arg Glu Leu Pro Ser lie Glu 
45 

Glu Gly Gin Gly He Ser Pro Leu 
60 

Val His He Ser Met Lys Gly Gin 

75 80 
Ala Ala Gly He Leu He Phe His 
90 95 
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(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 

Met Thr Lys Thr Ala Phe Leu Phe Ala Gly Gin Gly Ala Gin Tyr Leu 

1 5 10 15 

Arg Met Gly Arg Asp Phe Tyr Asp Gin Tyr Pro He Val Lys Glu Thr 

20 25 30 

He Asp Arg Ala Ser Gin Val Leu Gly Tyr Asp Leu Arg Tyr Leu He 

35 40 45 

Asp Thr Glu Glu Asp Lys Leu Asn Gin Thr Arg Tyr Thr Gin Pro Ala 

50 55 60 

He Leu Ala Thr Ser Val Ala He Tyr Arg Leu Leu Gin Glu Lys Gly 
65 70 75 80 

Tyr His Pro Asp Met Val Ala Gly Leu Ser Leu Gly Glu Tyr Ser Ala 

85 90 95 

Leu Val Ala Ser Gly Ala Leu Asp Phe Glu Asp Ala Val Ala Leu Val 

100 105 HO 

Ala Lys Arg Gly Ala Tyr Met Glu Glu Ala Ala Pro Ala Asp Ser Gly 

115 120 125 

Lys Met Val Ala Val Leu His Thr Pro 
130 135 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 
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Met Tyr Gin Phe Ser Gly Gin Thr Lys Val Xaa Glu Val Leu Ala Phe 

15 10 15 

Arg Asp Lys Pro Pro Tyr Gly Gly Ser Ser Ala Met Pro Leu Arg Cys 
20 25 30 

Leu 



(2) INFORMATION FOR SEQ ID NO: 409: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 



Met Thr Lys Glu Thr Gly Met Glu 

1 5 
Asp Gly Val Lys Arg Gly Leu Val 
20 

Gin Arg Gly Phe Thr lie Glu Lys 

35 40 
Arg Arg Val Asp 
50 



Gin Thr Phe Phe He He Lys Pro 

10 15 
Gly Glu Val Leu Lys Arg He Glu 
25 30 
Leu Glu Phe Arg Ser Gin Val Phe 
45 



(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Thr Ser Gly Pro Val Leu Val Gly Val He Ser Gly Pro Lys Val 
15 10 15 
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He Glu Thr Trp Arg Thr Met Met Gly Ala Thr Arg Pro Glu Glu Ala 

20 25 30 

Leu Pro Gly Thr He Arg Gly Asp Phe Ala Lys Ala Ala Gly Glu Asn 

35 40 45 

Glu He He Gin Asn Val Val His Gly Ser Asp Ser Glu Lys Asn Gin 

50 55 60 

Leu Ser Arg Glu He Ala Pro Leu Val Leu Arg Val Asp Trp Leu Asn 
65 70 75 80 

Gin Leu Asp Lys Ser Ser Phe Glu 
85 



(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



Met Gly Leu Lys Asp Asn Ala Gly Leu Val Lys Glu Tyr Ala Leu Val 

1 5 io 15 

Asp Ala Val Glu Tyr Gin Asn Val He Val Ala Thr Thr Val Glu Glu 

20 25 30 

Met Leu Ser Lys Tyr Ala Asn Lys Asn Asp Leu Glu lie Asp Asn Ala 

35 40 45 

Thr Thr Glu Ser He Lys Gly Val Val Ala Asp Leu Lys Ser Ala Val 

50 55 60 

He Lys Gly Asp Thr Val Tyr Phe Phe Lys Val Asp Gly Lys He Tyr 
65 70 75 bo 

Lys Val Lys Ala Ser Val Ser Asp Asp Leu Pro Tyr Leu Glu Asn Gly 

85 90 95 

Lys Thr Phe Glu Gly Gin Val Xaa Lys Asp Asn Tyr Leu Lys Thr Phe 

ioo 105 110 

Lys Val Gin 
115 



(2) INFORMATION FOR SEQ ID NO: 412: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 



Met He Ser Gly Ser Glu He Arg 

1 5 
Leu Ala Asp Lys Thr Glu Thr Leu 
20 

Gly Leu Asp Asn He Thr Val Ala 
35 40 

Glu Glu 
50 



Asp He Val Thr Ser Asp He Pro 

10 15 
Val Arg Phe Ala Asn Asn Ala Gly 
25 30 
Leu Val Ser Met Asn Glu Glu Asp 
45 



(2) INFORMATION FOR SEQ ID NO: 413: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



Met Pro Asp Asn Leu Ala Leu Arg 

1 5 
Val He Gly Gin Glu Xaa Leu Val 
20 

Met Val Glu Ala Asn Arg Leu Ser 

35 40 
Gly He Gly Lys Thr Ser He Ala 

50 55 
Tyr Ala Phe Arg Thr Phe Asn Ala 
65 70 



Met Arg Pro Lys Thr He Asp Gin 

10 15 
Gly Pro Gly Lys He He Arg Arg 
25 30 
Ser Met He Leu Tyr Gly Pro Pro 
45 

Ser Ala He Ala Gly Thr Thr Lys 
60 

Thr Val Asp Ser 
75 



(2) INFORMATION FOR SEQ ID NO: 414: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Pro Glu Glu Leu Ala Tyr Leu Val Gin His Leu Asp Asn Ala Gin 

Glu Gin Val Val Leu Gly Asn Thr Tyr His Thr Gly Asn His Cys Phe 
20 25 3 0 

Ser 



(2) INFORMATION FOR SEQ ID NO:415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 



Met Val Phe Trp Glu Gly Leu Val Pro Thr Pro Asp Val lie Asp Ala 

15 10 15 

Gly His Val Thr Gly Phe Leu Tyr Thr Val His Ser Ser Ser Thr Gly 

20 25 30 

Leu He Val Leu Gin He Lys Lys Asp Leu Leu Lys Leu 
35 40 45 

(2) INFORMATION FOR SEQ ID NO:416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Thr Asp Arg Tyr Asp Ser Ser Leu Gly lie Tyr Lys Val Pro Trp 

15 10 15 

Cys Val Ser Gin Gin Gin Thr Val Thr Glu lie Met Asp Thr Tyr Cys 

20 25 30 

Asp Trp Gly Val Lys Tyr Pro Leu Val Tyr Leu Glu Asp Pro Phe Ser 

35 40 45 

Asp Glu Asp Leu Asp Ser Trp Arg Lys Phe Gin Leu lie Lys Pro Leu 

50 55 60 

Lys Leu Gin Val Phe Gly Asp Asp Phe Tyr Ala Thr Asn Leu Glu Arg 
65 70 75 80 

lie Ser Gin Phe Lys Asp Cys Ala Asp Gly lie Val lie Lys Pro Asn 

85 90 95 

Gin Val Gly Ser Val Ser Lys Thr Leu Glu Val Met Glu Tyr Ala Glu 

100 105 110 

Lys Ser Gly lie Ser Met Ala Phe Ser Gin Arg Thr Ala Glu Thr Glu 

115 120 125 

Asn Asn lie lie Ser His Leu Ala Met Ser Val lie Leu Leu lie 
130 135 140 

(2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

Met Lys Lys lie Ser Lys Gin Glu Leu Val Asn Thr Arg Phe Ser Arg 

15 10 15 

Leu Phe Ala His Phe Gly Gin Glu Lys Asp Gly Ser Phe Leu Ala Gin 

20 25 30 

Arg Tyr Gin Phe Tyr Leu Ala Gin Gin Gly Gin Thr Leu Ser Gly Ala 

35 40 45 

His Asp Leu Leu Asp Ser Leu lie Glu Arg Asp Tyr Asn Leu Tyr Ala 

50 55 60 

Ala Thr Asn Gly lie Thr Ala lie Gin Thr Gly Arg Leu Ala Gin Ser 
65 70 75 80 
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Gly Leu Ala Pro Tyr Phe Asn Gin 
85 

Thr Gin Lys Pro Asp Ala Leu Phe 
100 

Ala Gly Phe Ser Lys Glu Lys Thr 
115 120 
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Val Phe lie Ser Glu Gin Leu Gin 

90 95 
Tyr Glu Lys He Gly Gin Gin He 
105 no 



(2) INFORMATION FOR SEQ ID NO: 418: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

Met His Arg Lys Thr Val He Asp Xaa Arg Thr Leu Gly Glu Arg Tyr 

15 10 15 

Thr Phe Thr Xaa Pro He Lys Glu Leu Lys Thr Arg Asn Val Ala Glu 

20 25 30 

Val Ala Asp Leu Leu Ala Xaa Val Glu Ser Tyr Gin Glu Gin Asp Tyr 

35 40 45 

Tyr Val Val Gly Tyr Val Ser Tyr Glu Ala Ala Pro Ala Phe Glu Glu 

50 55 60 

Lys Leu Ala Val His Lys Val Pro Leu Leu Gly Glu Tyr Leu Leu Tyr 
65 70 75 80 

Phe Thr Val His Asp Arg Val Glu Thr Ser Pro lie Pro Leu Thr Tyr 

85 90 95 

Glu Asp He Asp Leu Pro Ser Asn Trp Gin Gly Xaa Asn Val Cys Thr 
100 105 no 

Glu Leu 



(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 



Met Glu Arg lie Glu Val Arg Ser 

1 5 
His Val Phe Thr Asp Gly Pro Arg 
20 

lie Asn Ser Ala Ser Leu Arg Phe 

35 40 
Ala Gly Tyr Gly Tyr Leu Leu Pro 
50 55 



Arg Ser Gly Ser Ala His Leu Gly 

10 is 
Glu Leu Gly Gly Leu Arg Tyr Cys 
25 30 
Val Ala Lys Asp Glu Met Glu Lys 
45 

Tyr Leu Asn Lys 
60 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 



Met Glu Trp He Arg Leu He Gly 

1 5 
He Leu Lys Phe Asp Thr He Ala 
20 

Thr Ala Leu Val Ser Gly Val Ser 

35 40 
Gly Lys Glu Phe Ser Asn Gin Arg 

50 55 
Leu Pro Leu Val Gly Leu Ser Glu 
65 70 
Ser He 



He Ala He lie Val Val Gly Phe 

10 15 
Thr Val Val Leu Ala Gly Leu Val 
25 30 
Leu Val Glu Phe Leu Glu lie Leu 
45 

Val Leu Thr lie Phe Met Val Thr 
60 

Thr Phe Gly Leu Asn Asn Asp Gin 
75 80 



(2) INFORMATION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 

Met Leu Leu lie Phe Ala Val lie Gly Val Leu Val Ala Leu lie Ala 

15 10 15 

Gin Phe Tyr Ser Ala Lys Ala Ala Val Gly Phe Ala Lys Glu Leu Thr 

20 25 30 

Asn Asp Leu Tyr Arg His lie Leu Ser Leu Pro Lys Asp Ser Arg Asp 

35 40 45 

Arg Leu Thr Thr Ser Ser Leu Val Thr Arg Leu Thr Ser Asp Thr Tyr 

50 55 60 

Gin lie Gin Thr Gly lie Asn Gin Phe Leu Arg Leu Phe Leu Arg Ala 
65 70 75 80 

Pro lie He Val Phe Gly Ala He Phe Met Ala Tyr Arg He Ser Ala 

85 90 95 

Glu Leu Thr Phe Trp Phe Leu Val Met Val Ala He Leu Thr He Val 

100 105 110 

He Val Arg Val lie Ser He Gly Gin Ser Ser Leu Gin 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 422: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

Met Pro He Glu Gin Val Gin Lys Leu Leu Gly His Ser Lys He Asp 

15 10 15 

Thr Thr Leu Ala Tyr Ala Met Val Asn Gin Asn Asn Val Lys His Ser 

20 25 30 

His Gin Lys Phe He Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 423: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:423: 

Met Lys Pro Cys Trp Lys Lys Cys Phe Pro Glu Glu Val Ser Ser Glu 

15 10 15 

Gly Glu Val Thr Leu lie Glu lie Pro Val Ser Asp Lys lie Ala Gly 

20 25 30 

Lys Gin Val His Glu Leu Asn Leu Pro His Asn Val Leu lie Thr Thr 

35 40 45 

Gin Val His Asn Gly Lys Ser Gin Thr Val Asn Gly Ser Thr Arg Met 

50 55 60 

Tyr Leu Gly Asp Met He His Leu Val He Pro Lys Ser Glu He Gly 
65 70 75 80 

Lys Val Lys Asp Leu Leu Leu 
85 



(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 

Met Asn Glu Gly Val Glu Asn Phe . Arg Ala Lys Leu Ala Ser Leu Gly 

15 10 15 

Ala Lys Asn He Gly He Tyr Val Gly Val Tyr Phe Met Glu Glu His 

20 25 30 

Ser He Asp Thr Gly Lys Phe Thr Ser Val Trp He Pro Ser Tyr Gly 

35 40 45 

Ser Asp Ser Gly Phe Leu Glu Ser Ser Pro Lys Thr Asp Leu Asp Tyr 
50 55 60 
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Asp lie His Gin Tyr Thr Ser Lys Gly Lys Leu Pro Ala Leu Thr Thr 
65 70 75 80 

He 



(2) INFORMATION FOR SEQ ID NO: 42 5: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:425: 

Met Val Ser Lys Tyr Leu Leu Leu Ala Val He Phe Ser Gly Leu Val 

15 10 15 

Thr Trp He Leu Arg Met He Ser Phe He Leu Val Lys Tyr Lys Gly 

20 25 30 

Leu Pro Ala He Val Glu Arg Phe Leu Lys Phe Leu Pro Val Ser He 

35 40 45 

He Phe Ala Leu He Leu Ser Ser Val Val Thr Gly Lys Val Gly Ser 

50 55 60 

Leu Pro Gin He Lys Trp Leu Asp Phe Leu Ala Val Phe Pro Thr Ala 
65 70 75 80 

Trp Val Ala Phe Arg Tyr Arg Asn Leu Leu Gly Thr Val Leu Phe Gly 

85 90 95 

Val Val Leu He Ala He Leu Arg Leu Val Ser 
100 105 

(2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 141 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 
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Met Glu Asp Xaa Glu 

1 5 
Glu Asn Asn Arg Trp 
20 

Asn Asp Glu Ala Phe 
35 

Thr Gly Leu Thr He 
50 

Pro Leu Asp Thr Gly 
65 

Glu Phe Ser Gly Thr 
85 

Val Gin Lys Asp Gin 
100 

Pro Leu Met Glu Met 
115 

Pro Arg Arg Thr Glu 
130 



Thr Gin Arg Val Val Met 
10 

Ser Gly Tyr Ala Phe Pro 
25 

Ala Glu Ser Val He Arg 
40 

Gin Asn Pro Gin Leu Val 
55 

Gly Arg Tyr He Val He 
70 75 
Leu Gin Ser Ser Glu Glu 
90 

He Pro Asn Leu Asn Leu 
105 

Met Glu Ala Pro Asp Lys 
120 

Asp Asp Trp Glu Lys Lys 
135 
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Gin Tyr Arg Ala Pro 
15 

Gly Gly His Val Glu 
30 

Glu He Tyr Glu Glu 
45 

Gly He Lys Asn Trp 
60 

Cys Tyr Lys Ala Thr 
80 

Gly Glu Val Ser Trp 
95 

Ala Tyr Asp Met Leu 
110 

Ser Glu Phe Phe Tyr 

125 
lie Phe 
140 



(2) INFORMATION FOR SEQ ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 



Met Thr He Glu Leu Leu Thr Pro 

1 5 
Glu He Lys Glu Lys Lys Arg Lys 
20 

Phe Asn Pro Val His Asn Ala His 

35 40 
Gin Gin Leu Gly Leu Asp Gin Val 

50 55 
Pro His Val 
65 



Phe Thr Lys Val Glu Leu Glu Pro 

10 15 
Gin Val Gly He Leu Gly Gly Asn 
25 30 
Leu He Val Ala Asp Gin Val Arg 
45 

Leu Leu Met Pro Glu Tyr Gin Pro 
60 



(2) INFORMATION FOR SEQ ID NO: 428: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 

Met Lys Lys Leu Val Phe Val Cys Leu Gly Asn lie Cys Arg Ser Pro 

15 io 15 

Met Ala Glu Phe Val Met Lys Ser Met Thr Asp Asn Tyr Glu He Gin 

20 25 30 

Ser Arg Ala Thr Ser Ser Trp Glu His Gly Asn Pro He His Lys Gly 

35 40 45 

Thr Gin Gly He Phe Gin Glu Tyr Glu He Pro Tyr Asp Lys Asn Lys 

50 55 60 

Thr Ser Leu Gin He Ser Lys Glu Asp Phe Glu Ala Phe Asp Tyr He 
65 70 75 80 

He Gly Met Asp Ala Ser Asn Val Pro Thr Tyr Val Arg Cys Val Gin 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

Met Val Lys Tyr Gly Val Val Gly Ala Gly Tyr Phe Gly Ala Glu Leu 

1 5 io 15 

Ala Arg Tyr Met Gin Lys Asn Asp Gly Ala Xaa He Thr Leu Leu Tyr 

20 25 30 

Asp Pro Asp Asn Ala Glu Ala He Ala Glu Glu Leu Gly Ala Lys Val 

35 40 45 

Ser Lys Phe Leu Arg 
50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 

Met Leu Trp Lys Tyr Tyr Asn Ser Ser Phe Leu Lys Ala Val Pro His 

1-5 10 15 

Leu Thr Thr Glu Tyr Lys Arg Leu Ala Gin Ala His Gly Leu Asn Leu 

20 25 30 

Lys Gin Ala Lys Pro lie Thr Met Gly Met Trp lie Gly Gly Asp Arg 

35 40 45 

Glu Gly Asn Pro Phe Val Thr Ala Lys Asn Leu Lys Gin Ser Ala Leu 

50 55 60 

Thr Gin Cys Glu Val Xaa Leu Asn Leu Leu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Asn Thr Leu Tyr Leu Asn Gin His Ser Ser Tyr Lys Asn Asp Glu 

15 10 15 

Asp Ser His Ser Phe Pro lie Gin Met Glu Leu Val Ser Asp Glu Met 

20 25 30 

He Pro Arg Asn 
35 

(2) INFORMATION FOR SEQ ID NO: 432: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 

Met Ala Arg Lys Glu Thr Met Ala Glu Arg Lys Tyr Cys Lys Met Lys 

15 10 15 

Lys Asp He His Pro Glu Tyr Arg Pro Val Val Phe Met Asp Thr Thr 

20 25 30 

Thr Gly Tyr Gin Phe Leu Ser Gly Ser Thr Lys Arg Ser Asn Glu Thr 

35 40 45 

Val Glu Phe Glu Gly Glu Thr Tyr Pro Leu He Arg Val Glu He Ser 

50 55 60 

Ser Asp Ser His Pro Phe Tyr Thr Gly Arg Gin Lys Phe Thr Gin Ala 
65 7 0 75 80 

Asp Gly Arg Val Asp Arg Phe Asn Lys Lys Tyr Gly Leu Lys 
85 90 

(2) INFORMATION FOR SEQ ID NO: 433: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

Met Glu Thr Ala Leu He Ser Val lie Val Pro Val Tyr Asn Val Ala 

1 5 10 15 

Gin Tyr Leu Glu Lys Ser He Ala Ser He Gin Lys Gin Thr Tyr Gin 

20 25 30 

Asn Leu Glu He He Leu Val Asp Asp Gly Ala Thr Asp Glu Ser Gly 

35 40 45 

Arg Leu Cys Asp Ser He Ala Glu Gin Asp Asp Arg Val Ser Val Leu 
50 55 60 
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His Lys Lys Asn Glu Gly Leu Ser Gin Ala Arg Asn Asp Gly Met Lys 
65 70 75 80 

Pro Gly Ser Arg Gly Leu Ser Asp Phe Tyr 
85 90 

(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 : 

Met Tyr Phe Leu lie Asn Phe Val Tyr Pro Val Asp Met Val lie Asn 

1 5 10 15 

Leu Pro Phe Leu lie Asn Thr Gly Leu lie Val Leu Leu Ser Ala lie 

20 25 30 

Ser Tyr lie Ser Leu Leu Val Phe Thr Lys Asp Ser lie Phe Tyr Glu 

35 40 45 

Phe Leu Asn His Val Leu Ala Leu Lys Asn Lys Phe Lys Lys Ser 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435: 

Met Arg Val lie Ala Trp Leu Glu Pro Ser Arg Lys lie Leu Gin Pro 

15 10 15 

Gly Ala Asn Asp Val Trp Val Val Lys Arg Lys Gly Lys Arg Asp Leu 

20 25 30 

Leu Leu Pro Tyr lie Pro Pro Val Val Leu Asn Val Asp lie Pro Asn 
35 40 45 
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Lys Arg Val Asp Val Glu lie Leu Glu Gly Leu Asp Asp Glu Asp 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 



Met Asp Asp Asp Phe Lys Arg Tyr Asn Ala Pro lie Leu Thr Trp Tyr 

15 10 15 

Glu Thr Ala Arg Tyr Ala Phe Glu Arg Gly Met Val Trp Gin Asn Leu 

20 25 30 

Gly Gly Val Glu Asn Ser Leu Asn Gly Gly Leu Tyr His Phe Lys Glu 

35 40 45 

Lys Phe Asn Pro Thr lie Glu Glu Tyr Leu Gly Glu Phe Thr Met Pro 

50 55 60 

Thr His Pro Leu Tyr Pro Leu Leu Arg Leu Ala Leu Asp Phe Arg Lys 

65 ™ 75 80 

Thr Leu Arg Lys Lys His Arg Lys 

85 



(2) INFORMATION FOR SEQ ID NO:437: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437; 

Met Glu Asp Phe Phe Ala Trp Cys Arg Arg Gin Ser Val Leu Ser Gly 

Ser Lys Leu Gly Arg Ala He Glu Tyr Ser Leu Lys Tyr Glu Glu Thr 
20 25 30 
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Phe Lys Thr He Leu Lys Asp Gly His Leu Val Leu Ser Asn Asn Leu 

35 40 45 

Ala Glu Arg Ala He Lys Ser Leu Val Met Gly Arg Ser Lys Arg Val 

50 55 60 

Gin Trp Thr Leu Leu Ala 
65 70 



(2) INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 



Met Thr Lys Gin Gly Ser Trp Lys 

1 5 

Gly Arg Glu Leu Phe Pro He Leu 
20 

Phe Asp Gin Gin Ala Pro Xaa He 
35 40 



Leu He Leu Glu Asp Glu Ser Ala 

10 15 
Thr Gin Gly Xaa Tyr He Ala Thr 
25 30 
Asp Glu He Phe Lys Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 439: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:439: 



Met Thr Thr Asn Xaa Asp Asn Asp 

1 5 
Arg Lys Arg Thr Ala Gly Glu Thr 
20 

Thr His Gly Gly Asn Tyr Thr He 
35 40 



Thr Ser Glu Arg Ser Asp Cys Tyr 

10 15 

Pro Met Asn Asp Leu Thr Ser His 

25 30 

Ala Arg Tyr Gin Glu Glu Lys Phe 
45 
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Trp Asn Lys Gin Leu 
50 
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(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:440: 

Met Ala Gin His Ala Ala Val Glu Ala Leu Thr Ala Gly Lys Asn Asp 

15 io 15 

Ala Glu Pro Met Lys Lys Glu Tyr He Gin Arg Arg Asp Tyr He He 

20 25 30 

Glu Lys Met Thr Ala Leu Gly Phe Glu He He Lys Pro Asp Gly Ala 

35 40 45 

Phe Tyr He Phe Ala Lys He Pro Ala Gly Tyr Asn Gin Asp Ser Phe 

50 55 60 

Ala Phe Leu Lys Asp Phe Ala Gin Lys Lys Ala Val Ala Phe He Pro 
65 ™ 75 so 

Gly Ala Ala Phe Gly Arg Tyr Gly Glu Gly Tyr Val Arg Leu Ser Tyr 

85 90 95 

Ala Ala Ser Met Glu Thr He Lys Glu Ala Met Lys Arg Leu Glu Glu 

100 105 no 

Tyr Met Arg Glu Ala 
115 

(2) INFORMATION FOR SEQ ID NO: 441: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:441: 
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Met Asn He Thr Asn Leu Phe Ser He Lys Thr Gly Cys Asp Glu Thr 

15 10 15 

Asp Arg Gin Leu Gin Lys Leu Phe Phe Gin Leu Asp Leu Gin Leu Gly 

20 25 30 

Glu Leu Thr Asp Gin Leu Arg Lys Leu Asp Ser Asn Phe Val Pro Arg 

35 40 45 

Ser Gin Phe Val Asp Thr Leu Asp Leu Asn Asp Val Glu Tyr Lys Glu 

50 55 60 

He Leu Asn Tyr Phe He Phe His Arg Asn Asp Ser Glu Glu Ser Leu 
65 70 75 80 

Val Glu Trp Leu Tyr Asp Trp He Ser Thr Asn Arg Tyr Glu Leu Pro 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 

Met Leu Gin Trp He Asp Arg Thr Phe Gin Asp Tyr Leu Glu Leu Glu' 

15 10 15 

His Leu Val Leu Thr Thr Trp Ser Gly Asn Leu Gly Met Met Lys Leu 

20 25 30 

Ala Glu Lys Leu Xaa Met Lys Lys Xaa Ala His He Xaa Lys Val Arg 

35 40 45 

Tyr Tyr Gin Gly Lys Tyr Phe Asp Ser He Lys Tyr Gly He Leu Arg 

50 55 60 

Xaa Asp Trp Glu Lys He Asn Asp Gly Tyr Tyr Gin He Tyr Xaa Asn 
65 70 75 80 

Ser 



(2) INFORMATION FOR SEQ ID NO:443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



324 



WO 97/37026 

(D) TOPOLOGY: linear 



PCT/US97/05306 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

Met He Val Glu Glu Val Arg Asp Arg Tyr Val Gly Lys Val Asp Ala 

15 10 15 

Val Phe His Asn Gly Asp Ser Glu Leu Arg Pro Asp Ser Pro Leu Trp 

20 25 30 

Glu Gly He Arg Val Val Lys Gly Asn Met Asp Phe Tyr Ala Gly Tyr 

35 40 45 

Pro Glu Arg Leu Val Thr Glu Leu Gly Ser Thr Lys He He Gin Thr 

50 55 60 

His Gly His Leu Phe Asp He Asn Phe Asn Phe Gin Lys Leu Asp Tyr 
65 70 75 80 

Trp Ala Gin Glu Glu Glu Ala Ala He Cys Leu Tyr Gly His Leu His 

85 90 95 

Val Pro Ser Ala Trp Leu Glu Gly Lys He Leu Phe Leu Asn Pro Gly 

100 105 no 

Ser He Ser Gin Pro Arg Gly Thr He Arg Glu Cys Leu Tyr Ala Arg 

115 120 125 

Val Glu He Asp Asp Ser Tyr Phe Lys Val Asp Phe Leu Thr Arg Asp 

130 135 140 

His Glu Val Tyr Pro Gly Cys Pro Arg Ser Leu Ala Asp Asp Ala Lys 
145 150 155 160 

Glu Leu Arg Leu Ser Val Gly Ala Glu Asn Phe 
165 170 

(2) INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

Met Asn Leu His Gin Pro Leu His Val Leu Pro Gly Val Gly Pro Lys 
15 io 15 
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Ser Ala Glu Lys Tyr Ala Lys Leu 
20 

Leu Leu Tyr Phe Pro Phe Arg Tyr 

35 40 
Leu Glu Leu Glu Asp Gly Glu Lys 

50 55 
Thr Pro Ala Ser Val Gin Tyr Tyr 
65 70 
Ala Phe Leu Val Leu Gin Ala Arg 
85 

Gly 
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Gly lie Glu Asn Leu Gin Asp Leu 
25 30 
Glu Asp Phe Lys Thr Lys Gin Val 
45 

Ala Val Leu Ser Gly Gin Val Val 
60 

Gly Phe Gin Ala Ala Asn Pro Pro 

75 80 
Glu Glu Gly Ser Phe Phe Phe Gly 
90 95 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Gin Leu Leu Ser Glu Asn Met Leu Lys Thr lie Gin Ser Leu Ser 

IS 10 is 

Val Trp Gin lie Tyr Leu Leu Gly Phe Glu Arg lie Leu Ala Leu Gly 

20 25 30 

Phe Gin Leu Leu Leu Thr Val Trp Val Tyr Gin Ala Val Arg Gin Lys 

35 40 45 

Lys Trp lie Tyr Leu Leu Ala Ala Tyr Gly Leu His Ala Phe Phe Asp 

50 55 60 

Leu Ala Pro Ser Leu Phe Gin Val Gly Trp Leu Thr Asn Pro Val Leu 
65 70 75 80 

Val Glu Val lie Leu Ala Leu Glu Leu Val Leu Val Ala Tyr Gly Thr 

85 90 95 

Lys Glu lie Phe Cys Lys Lys Ser 
100 

(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 122 amino acids 
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(B) TYPE: amino acid 

(C) STRAND EDNESS ; single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 6: 



Met Thr Lys Ser Met Thr Pro Asp Arg Glu Val He Thr Phe He Pro 

15 10 15 

Glu Lys Phe He Val Asp Gly Phe Gin Gly He Arg Asp Pro Arg Gly 

20 25 30 

Met Met Gly Val Arg Leu Glu Met Arg Gly Leu Leu Tyr Thr Gly Pro 

35 40 45 

Arg Thr He Leu His Asn Leu Arg Lys Thr Val Glu Arg Ala Gly Val 

50 55 60 

Gin Val Glu Asn Val He He Ser Pro Leu Ala Met Val Gin Ser Val 
65 70 75 80 

Leu Asn Glu Gly Glu Arg Glu Phe Gly Ala Thr Val He Asp Met Gly 

85 90 95 

Ala Gly Gin Thr Thr Val Ala Thr He Arg Asn Gin Glu Leu Gin Phe 

100 105 HO 

Thr His He Leu Gin Glu Val Glu He Met 
115 120 



(2) INFORMATION FOR SEQ ID NO: 447: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:447: 

Met Leu Asn Ser He Val Thr lie He Cys He Ala Leu He Ala Phe 

15 10 15 

He Leu Phe Trp Phe Phe Lys Lys Pro Glu Lys Ser Gly Gin Lys Ala 

20 25 30 

Gin Gin Lys Asn Gly Tyr Gin Glu He Arg Val Glu Val Met Gly Gly 
35 40 45 
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Tyr Thr Pro Glu Leu lie Val Leu Lys Lys Ser Val Pro Ala Arg lie 

50 55 60 

Val Phe Asp Arg Lys Asp Pro Ser Pro Cys Leu Asp Gin lie Val Phe 
65 70 75 80 

Pro Asp Phe Gly Val His Ala Asn Leu Pro Met Gly Glu Glu Tyr Val 

85 90 95 

Val Glu lie Thr Pro Glu Gin Ala Gly Glu Phe Gly Phe Ala Cys Gly 

100 105 110 

Met Asn Met Met His Gly Lys Met He Val Glu 



(2) INFORMATION FOR SEQ ID NO:448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

Met Lys He Val Ser Gly He Tyr Gly Gly Arg Pro Leu Lys Thr Leu 

15 10 15 

Glu Gly Lys Thr Thr Arg Pro Thr Ser Asp Lys Val Arg Gly Ala He 

20 25 30 

Phe Asn Met He Gly Pro Tyr Phe Glu Val Gly Xaa Val Leu Asp Leu 

35 40 45 

Tyr Ala Gly Ser Gly Gly Leu Ser He Glu Ala Val Ser Arg Gly Met 

50 55 60 

Ser Ser Ala Val Leu Val Glu Arg Asp Arg Lys Ala Gin Thr He Val 
65 70 75 80 

Ala Glu Asn He Gin Met Thr Lys Glu Val Gly Lys Phe Gin Leu Leu 

85 90 95 

Gin Asn Gly Cys Arg Lys Gly He Gly Thr Gly Xaa Leu Gly Asn Leu 

100 105 110 

Thr Ser Phe Ser 



115 



120 



115 



(2) INFORMATION FOR SEQ ID 



NO:449: 



U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 52 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 



Met Xaa Thr Asn Phe Thr Asp Gin 

1 5 
Lys Glu Glu Lys Glu Thr lie Lys 
20 

Gly Lys Val Thr Leu Thr Asp Gin 

35 40 
Val Asn Leu Val 
50 



Val Asp Thr Met He Tyr Val Asp 

10 15 
Ala Ala Leu Val Glu Phe Phe Asn 
25 30 
Gly Leu Arg Glu Val Glu Val Pro 
45 



(2) INFORMATION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



Met He Gly Val Ala Leu Glu Glu Ser Pro Thr Leu Lys He His His 

1 5 io 15 

Leu Thr His Asp He Thr Pro Tyr Asn He Phe Glu Gly Ser Tyr Arg 

20 25 30 

Leu Phe Gin Thr Val Asp Tyr Trp Pro Glu Gly Thr Thr Phe Val Ser 

35 40 45 

Val Val Asp Pro Gly Val Gly Ser Lys Arg Lys Ser Val Val Ala Lys 

50 55 60 

Thr Ala Gin Lys Ser He His Cys His Ala Arg 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 451: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 130 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 



Met Glu Leu Arg Arg Pro Arg Leu Ala Asp Lys Lys Ala Val Leu Asp 

1 5 10 15 

Met Met Thr Glu Phe Glu Lys Ser Gin Ser Ala His Asp Gly Gly Phe 

20 25 30 

Trp Asp Thr Glu Asn Phe Val Tyr Glu Glu Trp Leu Glu Ser Asn Gin 

35 40 45 

Glu Gin Glu Met Gly lie Asn Leu Pro Glu Gly Trp Val Pro Ala lie 

50 55 60 

Gin Leu Val Ala Phe Ser Glu Lys Gly Gin Ala Val Gly Phe Leu Asn 
65 70 75 80 

Leu Arg Leu Arg Leu Ser Asn Phe Leu Leu Glu Glu Gly Gly His lie 

85 90 95 

Gly Tyr Ser lie Arg Pro Ser Glu Arg Gly Lys Gly Tyr Ala Lys Glu 

100 105 110 

Leu Ser Val Arg Ala Cys Lys Leu Leu Arg Lys Arg Thr Ser Arg Lys 
115 120 125 

Leu Trp 
130 



(2) INFORMATION FOR SEQ ID NO: 452: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



Met Thr Phe lie Arg Gly Pro lie lie Ser Thr Val Gly Glu Gly Val 

15 10 15 

Xaa lie Leu Ala Thr Val Asn Xaa Gin Phe Val Xaa Xaa Gin Xaa Xaa 

20 25 30 
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Asn Met Leu 
35 

Leu Xaa Gin 
50 



Val Ser Pro Phe His 
40 

Tyr Phe lie Ser Met 
55 



Pro Glu Leu Thr 

Cys Lys Glu Lys 
60 
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Asp xaa Xaa Arg 
45 

Ser Xaa Asp 



(2) INFORMATION FOR SEQ ID NO: 453: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 

Met Lys Gly Asp Val He Thr Glu His Gin Phe Tyr Glu Gin Val Lys 

15 10 15 

Asn Asn Pro Ser Ala Gin Xaa Val Leu Leu Asn Met Thr He Gin Lys 

20 25 30 

Val Phe Glu Lys Thr He Trp Leu Arg Ala 
35 40 



(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:454: 

Met Asp Lys Ala Leu Ala Asp Leu Lys Thr Ser Gly His Leu Pro He 

Pro Arg His Leu Arg Asp Gly His Tyr Xaa Gly Ser Lys Glu Leu Gly 

20 25 30 

Asn Ala Gin Asp Tyr Leu Tyr Pro His Asn Tyr Pro Xaa Asn Trp Val 

35 40 45 

Lys Gin Asp Tyr Leu Pro Gin Lys He Arg Asn His His Tyr Phe Gin 
50 55 eo 
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Ala Glu Tyr Thr Gly Lys Tyr Glu Arg Ala Leu Ala Gin Arg Lys Glu 
65 70 75 80 

Ala lie Asp His Leu Arg Lys lie 
85 

(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Val Val Phe Thr Gly Ser Thr Val Glu Glu Ala He Gin Lys Gly 

15 10 15 

Leu Lys Glu Leu Asp He Pro Arg Met Lys Ala His He Lys Val He 

20 25 30 

Ser Arg Glu Lys Lys Gly Phe Leu Gly Leu Phe Gly Lys Lys Pro Ala 

35 40 45 

Gin Val Asp He Glu Ala He Ser Glu Thr Thr Val Val Lys Ala Asn 

50 55 60 

Gin Gin Val Val Lys Gly Val Pro Lys Lys ile Asn Asp Leu Asn Glu 
65 70 75 80 

Pro Val Lys Thr Val Ser Val Arg Asn Arg 
85 90 

(2) INFORMATION FOR SEQ ID NO: 4 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Met Thr Lys Arg Cys Ser Trp Val Lys Met Thr Asn Pro Leu Tyr Ile 
15 10 15 



332 



WO 97/37026 PCT/US97/05306 

Ala Tyr His Asp Glu Glu Trp Gly Gin Pro Leu His Asp Asp Gin Ala 

20 25 30 

Leu Phe Glu Leu Leu Cys Met Glu Thr Tyr Gin Ala Gly Leu Ser Trp 

35 40 45 

Glu Thr Val Leu Asn Lys Arg Gin Ala Phe Arg Glu Ala Phe Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

Met Phe Val Gly Gin Glu Trp Thr Asn Gin Thr Phe Val Asp Leu Leu 

15 10 15 

Gly Asn His Gin Gly Gin Val Thr He Asp Glu Glu Gly Tyr Gly Gin 

20 25 30 

Phe Pro Val Ser Ala Arg Ser Val Ser Val Trp Ala Val Asn Thr He 
35 40 45 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

Met Ala Gin Val Asp He He His Gly He Gly Thr Gly Val He Arg 

15 10 is 

Glu Gly Val Thr Lys Tyr Leu Gin Arg Asn Lys His Val Lys Ser Phe 

20 25 30 

Gly Tyr Ala Pro Gin Asn Ala Gly Gly Ser Gly Ala Thr He Val Thr 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 459: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 



Met Asn Leu Leu Glu Glu Ala Thr Lys Gly Lys Val lie Phe Glu Gly 

15 10 15 

Val Asp He Thr Asp Lys Lys Asn Asp Leu Phe Pro Met Arg Glu Lys 

20 25 30 

Met Gly Met Val Phe Gin Gin Phe Asn Xaa Phe Leu He 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 460: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 60: 



Met Gin He Asp Leu Ala Asn Leu 

1 5 
Ser Gly He Asn Pro Asn Arg Tyr 
20 

Pro He Leu Asp Arg His Met Thr 

35 40 
Ala Gin Ala Ala Gly Thr He Ala 

50 55 
Tyr Ala Tyr Leu Phe Glu Thr Gin 
65 70 



Leu Pro Asp Leu Pro Gly Asn Leu 

10 15 
Val Phe Tyr Gin Asp Val Leu Cys 
25 30 
Pro Glu Gin Asp Lys Pro His Phe 
45 

Asp He Lys Glu Lys Ala Gly Asn 
60 

Ala Gin Leu Asn Ala He Leu Ser 
75 80 
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Ser Lys Val Asp Val Gly Arg Arg lie Arg His Ala Tyr Gin Ala Asp 

85 90 95 

Asp Lys Glu Ser Leu Gin Gin lie Ala Arg Gin Glu Leu Pro Glu Leu 

100 105 no 

Arg Ser Gin lie Glu Asp Phe His Ala Leu Phe Ser His His Trp Leu 
115 120 125 

Lys 



(2) INFORMATION FOR SEQ ID NO:461: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii). MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461: 



Met Arg Gin Gly 
1 

Glu Leu Asp Pro 
20 

Lys Asp Arg Phe 
35 

Leu Tyr Lys Arg 
50 



lie Asp Ser Leu 
5 

Phe Ser Gly Gin 

Lys Ala Leu Tyr 
40 

Phe Glu Asn Gly 

55 



Ala Tyr Val Val 
10 

Ala Phe Leu Phe 
25 

Trp Asp Gly Gin 
Lys Leu Thr 



Lys Thr His Phe 
15 

Cys Gly Gly Arg 
30 

Gly Phe Trp Leu 
45 



(2) INFORMATION FOR SEQ ID NO: 4 62: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 108 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 



Met Asn Ala Phe Leu Glu Glu Leu Ser Gin Ala Tyr Pro Asp Asp Tyr 
1 5 io 15 
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Phe Leu Leu Val Met 
20 

Lys lie Pro Thr Asn 
35 

Glu Met Asn Pro He 
50 

Phe Lys Asn Lys Ala 
65 

Gin Asp Val He Gin 
85 

Asn Arg Arg Trp Thr 
100 



Asp Asn Ala He Trp His 
25 

He Gly Phe Ala Phe He 
40 

Glu Gin Val Trp Lys Glu 
55 

Phe Arg He Leu Glu Asp 
70 75 
Gly Leu Glu Lys Glu Val 
90 

Arg Met Leu Phe Glu Ser 
105 
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Lys Ser Ser Val Leu 
30 

Pro Pro Tyr Thr Pro 
45 

He Arg Lys Arg Gly 
60 

Val Met Asn Gin Leu 
80 

He Lys Ser He Val . 
95 

Arg 



(2) INFORMATION FOR SEQ ID NO: 4 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 



Met Thr Val Glu Glu Glu Lys Val 

1 5 
Thr Glu Ala Gly Glu Phe Val Thr 
20 

Lys Lys Glu Leu Gly Arg Ser Tyr 

35 40 
Leu Lys Arg His Gly Trp Arg Asn 

50 55 
Lys Lys Ala Asp Ala Gin Thr He 
65 70 
He Gin Glu Asp Lys Lys Ala He 
85 



Phe Leu Ala Arg His Leu Lys Ala 

10 15 
He Asp Ala Leu Phe Gin Ala Tyr 
25 30 
Thr Arg Asp Ala Phe Tyr Gin Leu 
45 

He Met Pro Arg Pro Glu His Pro 
60 

Val Ala Ser Lys Asn Lys He Ser 
75 80 



(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

Met Leu Ser Leu Val Val Leu He Thr Gin Tyr Ala Thr His Gly Glu 

15 10 15 

Leu Phe Glu Asn Tyr He Ser Lys Glu Asn Asp Asn Val He Lys Glu 

20 25 30 

Tyr Gin Asn Thr Thr Gly Phe Ser Thr Pro Tyr Thr Leu Asp Gly Ser 

35 40 45 

Val Leu He Val His Pro Asp Leu Thr Lys Gly Met Asn He Glu Gly 

50 55 60 

Tyr Ser Asp Leu Leu Lys Pro Glu Leu Lys Gly Lys He Ala Thr Ala 
65 70 75 80 

Asp Pro Ala Asn Ser Ser Ser Ala Phe Ala Gin Leu Thr Asn Met Leu 

85 90 95 

Gin Ala Gin Gly Gly Tyr Lys Asp Asp Leu Ala Trp Ser Val Leu Ser 

100 105 HO 

Thr He Phe Ser Thr Leu He Asp Gly He Val Lys 
115 120 

(2) INFORMATION FOR SEQ ID NO: 4 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 65: 

Met Leu Ala Arg Lys Asp Arg Phe Tyr Tyr He Asp Lys Ala Thr Gly 

15 10 15 

Lys Ala Leu Gly lie Phe Leu Ser Cys Val Leu Thr Ser Asn Gly Val 

20 25 30 

He Glu Val Gly Ala Val Thr Phe Ser Pro Lys Leu Arg Gly Thr Arg 
35 40 45 
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He Gly Thr Glu Ala Gin Tyr Leu Leu Ala Arg Tyr Val Phe Glu Glu 

50 55 60 

Leu Asn Tyr Arg Arg Tyr Glu Trp Lys Cys Asp Ala Leu Asn Leu Pro 
65 70 75 80 

Ser Arg Arg Ala Ala Glu Arg Leu Gly Phe He Tyr Glu Gly Thr Phe 

85 90 95 

Arg Gin Ala Val Val Tyr Lys Gly Arg Thr Arg Asp Met Asp Trp Leu 

100 105 no 

Ser Met He Asp Lys Asp Trp Pro Lys Val Lys Asp Arg Leu Glu Thr 

115 120 125 

Trp Leu Arg Pro Glu Asn Phe Asp Lys Asn Gly Gin Gin Tyr Lys Ser 

130 135 140 

Leu Arg Glu Leu 
145 

(2) INFORMATION FOR SEQ ID NO: 466: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 

Met Met Thr Ser Asp Phe Phe Glu Val Trp Phe Gin Lys Phe Leu Leu 

15 10 15 

Pro Thr Leu Thr Thr Pro Ser Val He He Val Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 467: 
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Met Val Ser Thr Lys Thr Gin He Ala Gly Phe Glu Phe Asp Asn Cys 

15 10 15 

Leu Met Asn Ala Ala Gly Val Ala Cys Met Thr He Glu Glu Leu Glu 

20 25 30 

Glu Val Lys Asn Ser Ala Ala Gly Thr Phe Val Thr Lys Thr Ala Thr 

35 40 45 

Leu Asp Phe Arg Gin Gly Asn Pro Glu Pro Arg Tyr Gin Asp Val Pro 

50 55 60 

Leu Gly Ser He Asn Ser Met Gly Leu Pro Asn Asn Gly Leu Asp Tyr 
65 70 75 80 

Tyr Leu Asp Tyr Leu Leu Asp Leu Gin Glu Lys Glu Ser Asn Arg Thr 

85 90 95 

Phe Phe Leu Ser Leu Val Gly Met Ser Pro Glu Lys Pro lie Leu Phe 
100 105 HQ 



(2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



Met Thr Tyr Leu Met Phe Ser Gly Leu Asp Glu Asn Phe Tyr His Phe 

15 10 15 

Pro Trp Glu Val Phe Ala Gly Phe Gly He He Phe Leu Ala Cys Pro 
20 25 30 

Glu Lys Val 
35 



(2) INFORMATION FOR* SEQ ID NO: 469: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 



Met Phe Thr Gly Trp Phe Ala Gin Thr Phe Leu His Gin Phe He Arg 

15 10 15 

Gly Ala Trp Gly Leu Gly Phe Met He Phe He Ala Phe Pro Met Gly 

20 25 30 

Lys Glu Leu Leu Glu Gly Glu Tyr His Glu His Asp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 



Met Gin Glu Lys He Leu Val Thr Gly Gly Ala Gly Phe He Gly Thr 

15 10 15 

His Thr Val He Glu Leu lie Gin Ala Gly His Gin Val Val Val Val 

20 25 30 

Asp Asn Leu Val Asn Ser Asn Xaa Lys Ser Leu Glu Val Val Glu Arg 

35 40 45 

He Thr Gly Val Glu He Pro Phe Tyr Glu Ala Asp He Xaa Asp Thr 

50 55 60 

Asp Thr Leu Arg Asp He Phe Lys Gin Glu Glu Leu Thr Gly Val He 
65 70 75 80 

His Phe Ala Gly Leu Lys Ala Val Gly Glu Ser Thr Arg He Pro Leu 

85 90 95 

Ala Tyr Tyr Asp Asn Asn He Ala Gly Thr Val Ser Leu Leu Lys Ala 

100 105 110 

Met Glu Glu Asn Asn Cys Lys Asn He He Phe Ser Ser Xaa Ala Thr 

115 120 125 

Val Tyr Gly Asp Pro Tyr Thr Val Pro He Leu Glu Val Ser His Phe 
130 135 140 

Gin 
145 



(2) INFORMATION FOR SEQ ID NO: 471 : 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 : 



Met Val Lys Glu Lys Val Thr Leu Thr Pro Leu Ala Ser Gly Tyr Gin 

15 10 15 

He Gly Glu Glu Glu Phe Glu Gin Val He Leu Ala Xaa Gly Ala Trp 

20 25 30 

Leu Gly Asp Met Leu Glu Pro Leu Gly Tyr Glu Val Asp Val Arg Pro 

35 40 45 

Gin Lys Gly Gin Leu Arg Asp Tyr Gin Leu Val Gin Asp Met Glu Ala 

50 55 60 

Tyr Pro Val Val Met Pro Glu Gly Glu Trp Asp Leu He Pro Phe Ala 
65 70 75 80 

Gly Gly Lys Leu Ser Leu Gly Ala Thr His Glu Asn Asp Met Gly Phe 

85 90 95 

Asp Leu Thr Val Asp Glu Thr Leu Leu Gin Gin Met Glu Glu Ala Thr 

100 105 HO 

Leu Thr His Tyr Leu He Leu Ala Glu Ala Thr Ser Lys Ser Glu Arg 

115 120 125 

Val Gly He Arg Ala Tyr Thr Ser Asp Phe Ser Pro Phe Phe Gly Gin 

130 135 140 

Val Pro Asp Leu Thr Gly Val Tyr Ala Ala Ser Gly Leu Gly Ser Ser 
i45 . 150 155 160 

Gly Leu Thr Thr Gly Pro He He Gly Tyr His Leu Ala Gin Leu lie 

165 170 175 

Gin Asp Lys Glu Leu Thr Leu Asp Pro Glu Asn Tyr Pro He Glu Asn 

180 185 190 

Tyr Val Lys Arg Val Lys Ser Glu 
195 200 

(2) INFORMATION FOR SEQ ID NO: 472: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 



Met Glu His He He Tyr Gin Phe Glu Glu Asp Leu Ala He Leu Thr 

15 10 15 

Leu Asn Arg Pro Glu Val Ala Asn Gly Phe His He Pro Met Cys Glu 

20 25 30 

Glu He Leu Glu Ala Leu Thr Leu Ala Glu Glu Asn Pro Ala Val His 

35 40 45 

Phe He Leu He Asn Ala Asn Gly Lys Val Phe Ser Val Gly Gly Asp 

50 55 60 

Leu Val Glu Met Lys Arg Ala Val Asp Glu Asp Asp He Pro Ser Leu 
65 70 75 80 

Thr Lys He Ala Glu Leu Val Asn Thr lie Ser Tyr Lys He Lys Gin 

85 90 95 

He Ala Lys Pro Val Phe Asn Gly Ser 
100 105 



(2) INFORMATION FOR SEQ ID NO: 473: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 81 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: 

Met Gin Pro Glu Val He Trp Leu Gin Lys He Ala Lys Pro Ser Gin 

1 5 10 15 

Met Lys Val Leu Asn Thr Thr Asp Arg Val Gin Ala He Lys Asp Asp 

20 25 30 

Val Asp He He Gin Asn Ser Leu Gin He He Asn Gin Gin Lys Glu 

35 40 45 

Leu He Lys Glu Tyr Gin Glu Asp Leu Thr Tyr Lys Phe Lys Val Leu 

50 55 60 

Glu Lys Asp He Gin Thr Arg Thr Ser Cys Asp Lys Arg Asn Ala Gly 
65 70 75 80 

Asn 
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(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Gly Ser Asn Ser Leu Thr Leu Leu Leu Lys lie Gin Ala Thr lie 

15 10 15 

Pro Arg Asn He Tyr Asp Asn Leu Gin Ala Gly Ser Lys Val Thr Leu 

20 25 30 

Asp Arg Ala Tyr Gly His Met He He Glu Glu Gly Arg Glu Asn Gin 

35 40 45 

Val Trp He Ala Gly Gly He Gly He Thr Pro Phe He Ser Tyr He 

50 55 60 

Arg Glu His Pro He Leu Asp Lys Gin Val His Phe Tyr Tyr Ser Phe 
65 70 75 80 

Arg Gly Asp Glu Asn Ala Val Tyr Leu Asp Leu Leu Arg Asn Tyr Ala 

85 90 95 

Gin Lys Asn Pro Asn Phe Glu Leu His Leu He Asp Ser Thr Lys Asp 

100 105 110 

Gly Tyr Leu Asn Phe Glu Gin Lys Lys Cys Pro Asn Met Gin Pro Ser 

115 120 125 

He Cys Val Val Leu Phe Leu 
130 135 

(2) INFORMATION FOR SEQ ID NO: 475: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 
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Met Phe Phe Val Leu Ala Tyr Ala Lys Ala Val Ala Gly Arg Asn lie 

15 10 15 

Lys Leu His Pro lie Gin Arg Gin Gly Ala Gly Tyr His Ser Val Asn 

20 25 30 

Lys Asp Val Asp Phe Ala Ser Ala Thr Ala Leu Arg Gin His Gin Lys 

35 40 45 

Asp Gin Asp Phe Leu Glu Arg Phe Met Pro Ser Val Ala Leu Phe Glu 

50 55 60 

Gin Ala Ser Lys Val lie Trp Glu Asp Tyr Phe Pro Leu Leu Arg Tyr 
65 70 75 80 

Gin lie Leu Ser Asn Pro Asp Leu Thr Thr He Tyr Gin Val Asn Gin 

85 90 95 

Glu Met Ala Val Arg He Lys Glu Ala He Lys Thr Ala Gin Ser Val 

100 105 HO 

Glu Glu Leu Val Glu Leu Val Thr Thr Lys Arg Tyr Thr Lys Ala Arg 

115 120 125 

Val Arg Arg Leu Leu Thr Tyr He Leu Met Gin Ala Arg Glu Ser Asp 

130 135 140 

Leu Pro Glu Ala He His Val Leu Gly Phe Thr Glu Lys Gly Arg Gin 
145 150 155 160 

His Leu Lys Ser Leu Lys Gly Gin Val Asn Leu Val Ser Arg He Gly 

165 170 175 

Lys Glu Pro Trp Asp Ala Met Thr Gin Lys Val Asp Gin He Tyr Gin 

180 185 190 

Leu Gly Lys Pro Ser He Ala Glu Gin Asn Phe Gly Arg Val Pro He 

195 200 205 

Arg He Glu Thr Asn 
210 

(2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:476: 

Met Leu Gly Ser Met Phe Val Gly Leu Leu Val Gly Phe Leu Ala Gly 
15 10 15 
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Ala Met Thr Asn Arg Gly Glu Arg Met Gly Cys Phe Gly Lys Met Phe 

20 25 30 

Leu Gly Trp lie Gly Ala Phe Leu Gly His Leu Leu Phe Gly Thr Trp 

35 40 45 

Gly Pro Val Leu Ser Gly Thr Ala lie He Pro Ala Val Leu Gly Ala 

50 55 60 

Met He Val Leu Ala He Phe Trp Arg Arg Gly Ser 
65 70 75 

(2) INFORMATION FOR SEQ ID NO: 477: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

Met Arg Ser Ser Pro Ser Arg Glu He Gin Pro Arg Ser Trp Gly He 

15 10 15 

Thr Thr Val Tyr Val Thr His Asp Gin Glu Gly Ala Met Ala He Ser 

20 25 30 

Asp Gin lie Ala Cys Tyr Glu Arg Trp Gly Asp Pro Thr Asn Arg Pro 

35 40 45 

Thr Lys Arg Thr Val Ser 
50 
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1 . An isolated polynucleotide comprising a polynucleotide sequence selected from 
the group consisting of: 

5 (a) a polynucleotide having at least a 70% identity to a polynucleotide encoding a 

polypeptide comprising an amino acid sequence of selected from the group consisting of SEQ ID 
NO:224 through 477, 

(b) a polynucleotide which is complementary to the polynucleotide of (a); 

(c) a polynucleotide having at least a 70% identity to a polynucleotide encoding the 
1 0 same mature polypeptide expressed by the a gene comprising a polynucleotide sequence selected 

from the group consisting of SEQ ID NO: 1 through 223 contained in the DNA of the deposited 
clone; and 

(d) a polynucleotide comprising at least 1 5 sequential bases of the polynucleotide 
of (a), (b) or(c). 

15 2. The polynucleotide of Claim 1 wherein the polynucleotide is DNA. 

3. The polynucleotide of Claim I wherein the polynucleotide is RNA. 

4. The polynucleotide of Claim 2 comprising the nucleic acid sequence set forth in 
SEQIDNO:!. 

5. The polynucleotide of Claim 2 comprising a nucleotide sequence selected from 
20 the group consisting of SEQ ED NO: 1 through 223. 

6. The polynucleotide of Claim 2 which encodes a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:224 through 477. 

7. A vector comprising the polynucleotide of Claim 1 . 

8. A host cell comprising the vector of Claim 7. 

25 9. A process for producing a polypeptide comprising: expressing from the host 

cell of Claim 8 a polypeptide encoded by said DNA. 

10. A process for producing a noval polypeptide or fragment comprising 
culturing a host of claim 8 under conditions sufficient for the production of said polypeptide 
or fragment, 

30 11. A polypeptide comprising an amino acid sequence which is at least 70% 

identical to an amino acid sequence selected from the group consisting of SEQ ID NO:224 
through 477. 

12. A polypeptide comprising an amino acid selected from the group consisting of 
SEQ ID NO:224 through 477. 
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13. An antibody against the polypeptide of claim 1 1 . 

14. An antagonist which inhibits the activity or expression of the polypeptide of 
claim 11. 

15. A method for the treatment of an individual in need of a Streptococcal 

5 polypeptide comprising: administering to the individual a therapeutically effective amount of the 
polypeptide of claim 1 1 . 

16. A method for the treatment of an individual having need to inhibit a bacterial 
polypeptide comprising: administering to the individual a therapeutically effective amount of the 
antagonist of Claim 14. 

10 17- A process for diagnosing a disease related to expression or activity of the 

polypeptide of claim 1 1 in an individual comprising: 

(a) determining a nucleic acid sequence encoding said polypeptide, and/or 

(b) analyzing for the presence or amount of said polypeptide in a sample derived from 
the individual. 

15 1 8 * A method for identifying compounds which interact with and inhibit or activate 

an activity of the polypeptide of claim 1 1 comprising: 

contacting a composition comprising the polypeptide with the compound to be screened 

under conditions to permit interaction between the compound and the polypeptide to assess the 

interaction of a compound, such interaction being associated with a second component capable of 
20 providing a detectable signal in response to the interaction of the polypeptide with the 

compound; 

and determining whether the compound interacts with and activates or inhibits an 
activity of the polypetide by detecting the presence or absence of a signal generated from the 
interaction of the compound with the polypeptide. 

25 19. A method for inducing an immunological response in a mammal which 

comprises inoculating the mammal with Streptococcal polypeptide of claim 1 1, or a fragment 
or variant thereof, adequate to produce antibody and/or T cell immune response to protect 
said animal from disease. 

20. A method of inducing immunological response in a mammal which comprises 

30 delivering a nucleic acid vector to direct expression of Streptococcal polypeptide of claim 1 1 , 
or fragment or a variant thereof, for expressing said Streptococcal polypeptide, or a fragment 
or a variant thereof in vivo in order to induce an immunological response to produce 
antibody and/ or T cell immune response to protect said animal from disease. 
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21. An isolated Streptococcal polypeptide having one of the amino acid 
sequences of SEQ ID NO:224 through SEQ ID NO:234. 

22. An isolated nucleic acid encoding one of the amino acid sequences of 
Claim 21 and nucleic acid sequences capable of hybridizing therewith under stringent 

5 conditions. 

23. Recombinant vectors comprising the nucleic acid sequences of 
Claim 22 and host cells transformed or transfected therewith. 

24. A method of identifying an antimicrobial compound comprising contacting 
candidate compounds with a polypeptide of Claim 21 and selecting those compounds 

10 capable of inhibiting the bioactivity of said polypeptide. 

25. Antimicrobial compounds identified by the method of Claim 24. 



348 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/05306 



A. CLASSIFICATION OF SUBJECT MATTER 
IPC(6) :Plca*e See Extra Sheet. 

US CL :536723.7; 435/6, 69.1, 252.3, 320.1; 530/324. 350; 514/12 
According lo International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 
U.S. : 536723.7:435/6.69.1.252.3, 320.1; 5307324, 350; 514/12 

Documents lion searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of daU base and, where practicable, search terms used) 
Please See Extra Sheet. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Ciution of document, with indication, where appropriate, of the relevant pasaages 


Relevant to claim No. 


X. P 
Y, P 


WO 96/10647 A1 (FUSO PHARMACEUTICAL INDUSTRIES. 
LTD.) 11 April 1996, claim 2, pages 22-24. 


1. 2. 7, 8, and 
22 




3, 6, 9-12, 15, 
17, 21, and 23- 
25 


Y 


WATSON, J. D. at al. Recombinant DNA, Second Edition. 
New York: Scientific American Books, W.H. Freeman and 
Company, 1992, pages 73 and 74, see entire document. 


1-3, 6-12, 15, 
17. and 21-23 


Y 


KENNELL, D. E. Principles and Practices of Nucleic Acid 
Hybridization. Progress in Nucleic Acid Research Molecular 
Biology. 1 971 , Vol. 1 1 , pages 259-301 , see especially pages 
259-262. 


1-3, 6-12, 15, 
17, and 21-23 



| Purthcr documents are listed in the continuation of Box C. Q See patent family annex 



tt4*faiftf 8M f eonoi ototo of Ao Ml which ■ aot oonaidcrad 
lo W of puik»fcv idwws 




do* »rf mat m oooAjot with fee appikotioa bt* riled to ■ 
princ^fe or ttwory vadafy'm% Cfcc jumlMW 



C bo considered to woive v wostfvoMHp 



im m r a - r' tTrfirrt prinrlri tor ■friMiiiiil fihjag rfitr bul 




fomuy 



Date of the actual completion of the international search 
10 JULY 1997 



Date of mailing of the international search report 

0 6 AUG 1997 



Name and mailing address of the ISA/US 
Commissioner ofraetM* aad Tmfcmufcs 
Box PCT 

W—hingtna, D C. 20231 
Facsimile No. f703) 305-3230 



Form PCT/IS A/210 (second sbeetXJuly 1992)* 



Authorized officer 

. JAMES MARTIN ELL 

Telephone No. (703)308-0196 



i 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US97/05306 



Box I Observation* where certain claims were found unsearchable (Continuation of ken 1 of first sheet) 

This international report haa not been established in respect of certain claims under Article 1 7{2X») for the following reasons: 

1. I ] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



2. |~n Claims Nos.: 

— because they relate to parts of the intemationaJ application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. J J Claims Nos.: 

because they arc dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of iaTestsoo ts lacking (Continuation of kern 2 of first sheet) 



This International Searching Authority found mubiplc inventions in this international application, as follows: 
Please See Extra Sheet. 



1. | | As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 

claims. 

2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timely paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nos.: 



4. [ x| No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 
1-3, 6-12, 15. 17, and 21-23 (SEQ ID NO* 224-233). 



Remark on Protest | | The additional search fees were accompanied by the applicants protest. 

[ 1 No protest accompanied the payment of additional search fees. 



Form PCT/1S A/210 (continuation of first shcct<l))(July 1992)* 



i 



INTERNATIONAL SEARCH REPORT 



International application No. 
PC77US97/05306 



BOX D. OBSERVATIONS WHERE UNITY OP INVENTION IS LACKING (Continued): 

This application contain* the following inventions or croups of inventions which are n"' so linked w to form a single 
inventive concept under PCT Rule 13.1. In order for all inventions to be searched, the appropriate additional search 
fees must be paid. 

Group I. Claims 1-12, 15, 17, and 21-23, drawn to nucleic acids and polypeptides of specific sequence, vectors 
containing nucleic acids, methods of producing polypeptides, and methods of treatment using polypeptides. 

Group II. Claims 13 and 19, drawn to antibodies against any of no fewer than 254 specific sequence polypeptide*. 

Group ID. Claims 14 and 16, drawn to antagonists of undisclosed composition of the activity of no fewer than 254 
specific sequence polypeptides. 

Group IV. Claims 18 and 24, drawn to methods for screening for compounds that interact with no fewer than 254 
specific sequence polypeptides. 

Group V. Claim 20, drawn to a method of inducing an immunological response using a nucleic acid. 
Group V|. Claim 25, drawn to compounds of undisclosed composition. 

The inventions listed as Groups I-VI do not relate to a single inventive concept under PCT Rule 13.1 because, under 
PCT Rule 13.2, they lack the same or corresponding special technical features for the following reasons. The nucleic 
acids, proteins, and methods of Group I are materially different from the antibodies of Group II, the compounds of 
Group 111. and the compounds of Group VI and are not required to produce either the antibodies of Group II, which 
antibodies may be made with antigens other than those embraced by Group I. Additionally, the methods of Group I are 
not needed to produce the compounds of either one of Groups III or VI. The compositions and methods of Group I are 
not needed for the methods of either one of Groups IV or V. The antibodies of Group II arc not needed for the 
methods of Groups JV or V and are materially different from the compounds of Groups HI and VI. The antagonists of 
Group in are materially different from the compounds of Group VI because the antagonists and compounds have 
differing sites of action and/or binding. Neither arc the compounds of either one of Groups III or VI needed to practice 
the methods of cither one of Groups IV or V. The methods of Groups IV and V may be practiced independently of one 
another. 

This a p plication contains claims d i rected to more than one species of the generic invention. These species are deemed 
to lack Unity of Invention because they arc not so linked as to form a single inventive concept under PCT Rule 13.1. 
In order for more than one species to be searched, the appropriate additional search fees must be paid. The species arc 
as follows. Each of the 234 SEQ ID NOs listed is a separate species because there is no relationship between the 
species. 

The species listed above do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the species lack the same or corresponding special technical features for the following reasons. There is no 
relationship among the species. 

For any Group applicant elects, a total of 10 (TEN) specified sequences will be searched and no more than 4 (FOUR) 
specified sequences will be searched for each additional foe paid. 



The claims are deemed to correspond to the species listed above in the following manner: SEQ ID NOs 224-233 
correspond to claims 1-3, 6-12, 15, 17, and 21-23. 

A. CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

CI2N 15/31, 15/11, 15/00; C12P 21/02; A61K 38/16. 39/09; C07K 14/195, 14/315 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base and where practicable terms used): 

MPSRCH OF: A-GENESEQ26, A- ISSUED, HIV-AA8, PIR50, SUIWW-PROT34, EMBL-NEWll, CBNBANK97, 
GENBANK-NEWU, U-EMBL48_97, HIV-NA8, N-GENESEQ26, N- ISSUED. EST-STS, EST-STS TWO, EST^STS- 



Fonn PCT/1SA/210 (extra aheetXJuly 1992)* 



INTERNATIONAL SEARCH REPOR? 



International application No. 
PCT/US97/05306 



THREE. SEQ ID NOs 224-233. 



Form PCT/IS A/210 (extra iWXJuly 1992)* 



